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Chapter 1 

About This Book 


The MIPS32™ Architecture For Programmers Volume II comes as a multi-volume set. 

• Volume I describes conventions used throughout the document set, and provides an introduction to the MIPS32™ 
Architecture 

• Volume II provides detailed descriptions of each instruction in the MIPS32™ instruction set 

• Volume III describes the MIPS32™ Privileged Resource Architecture which defines and governs the behavior of the 
privileged resources included in a MIPS32™ processor implementation 

• Volume IV-a describes the MIPS16e™ Application-Specific Extension to the MIPS32™ Architecture 

• Volume IV-b describes the MDMX™ Application-Specific Extension to the MIPS32™ Architecture and is not 
applicable to the MIPS32™ document set 

• Volume IV-c describes the MIPS-3D™ Application-Specific Extension to the MIPS64™ Architecture and is not 
applicable to the MIPS32™ document set 

• Volume IV-d describes the SmartMIPS™Application-Specific Extension to the MIPS32™ Architecture 


1.1 Typographical Conventions 

This section describes the use of italic, bold and courier fonts in this book. 

1.1.1 Italic Text 

• is used for emphasis 

• is used for bits, fields, registers, that are important from a software perspective (for instance, address bits used by 
software, and programmable fields and registers), and various floating point instruction formats, such as S, D, and PS 

• is used for the memory access types, such as cached and uncached 

1.1.2 Bold Text 

• represents a term that is being defined 

• is used for bits and fields that are important from a hardware perspective (for instance, register bits, which are not 
programmable but accessible only to hardware) 

• is used for ranges of numbers; the range is indicated by an ellipsis. For instance, 5..1 indicates numbers 5 through 1 

• is used to emphasize UNPREDICTABLE and UNDEFINED behavior, as defined below. 

1.1.3 Courier Text 

Courier fixed-width font is used for text that is displayed on the screen, and for examples of code and instruction 
pseudocode. 
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Chapter 1 About This Book 


1.2 UNPREDICTABLE and UNDEFINED 

The terms UNPREDICTABLE and UNDEFINED are used throughout this book to describe the behavior of the 
processor in certain cases. UNDEFINED behavior or operations can occur only as the result of executing instructions 
in a privileged mode (i.e., in Kernel Mode or Debug Mode, or with the CPO usable bit set in the Status register). 
Unprivileged software can never cause UNDEFINED behavior or operations. Conversely, both privileged and 
unprivileged software can cause UNPREDICTABLE results or operations. 


1.2.1 UNPREDICTABLE 

UNPREDICTABLE results may vary from processor implementation to implementation, instruction to instruction, or 
as a function of time on the same implementation or instruction. Software can never depend on results that are 
UNPREDICTABLE. UNPREDICTABLE operations may cause a result to be generated or not. If a result is generated, 
it is UNPREDICTABLE. UNPREDICTABLE operations may cause arbitrary exceptions. 

UNPREDICTABLE results or operations have several implementation restrictions: 

• Implementations of operations generating UNPREDICTABLE results must not depend on any data source (memory 
or internal state) which is inaccessible in the current processor mode 

• UNPREDICTABLE operations must not read, write, or modify the contents of memory or internal state which is 
inaccessible in the current processor mode. For example, UNPREDICTABLE operations executed in user mode 
must not access memory or internal state that is only accessible in Kernel Mode or Debug Mode or in another process 

• UNPREDICTABLE operations must not halt or hang the processor 


1.2.2 UNDEFINED 

UNDEFINED operations or behavior may vary from processor implementation to implementation, instruction to 
instruction, or as a function of time on the same implementation or instruction. UNDEFINED operations or behavior 
may vary from nothing to creating an environment in which execution can no longer continue. UNDEFINED operations 
or behavior may cause data loss. 

UNDEFINED operations or behavior has one implementation restriction: 

• UNDEFINED operations or behavior must not cause the processor to hang (that is, enter a state from which there is 
no exit other than powering down the processor). The assertion of any of the reset signals must restore the processor 
to an operational state 


1.3 Special Symbols in Pseudocode Notation 

In this book, algorithmic descriptions of an operation are described as pseudocode in a high-level language notation 
resembling Pascal. Special symbols used in the pseudocode notation are listed in Table 1-1. 


Table 1-1 Symbols Used in Instruction Operation Statements 


Symbol 

Meaning 

<- 

Assignment 

=, * 

Tests for equality and inequality 

II 

Bit string concatenation 


A y-bit string formed by y copies of the single-bit value x 
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1.3 Special Symbols in Pseudocode Notation 

Table 1-1 Symbols Used in Instruction Operation Statements 


Symbol 

Meaning 

b#n 

A constant value n in base b. For instance 10#100 represents the decimal value 100, 2#100 represents the binary 
value 100 (decimal 4), and 16#100 represents the hexadecimal value 100 (decimal 256). If the "b#" prefix is 
omitted, the default base is 10. 

x y..z 

Selection of bits y through z of bit string x. Little-endian bit notation (rightmost bit is 0) is used. If y is less than 
Z, this expression is an empty (zero length) bit string. 

+, - 

2’s complement or floating point arithmetic: addition, subtraction 

*, X 

2’s complement or floating point multiplication (both used for either) 

div 

2’s complement integer division 

mod 

2’s complement modulo 

/ 

Floating point division 

< 

2’s complement less-than comparison 

> 

2’s complement greater-than comparison 

< 

2’s complement less-than or equal comparison 

> 

2’s complement greater-than or equal comparison 

nor 

Bitwise logical NOR 

xor 

Bitwise logical XOR 

and 

Bitwise logical AND 

or 

Bitwise logical OR 

GPRLEN 

The length in bits (32 or 64) of the CPU general-purpose registers 

GPR[x ] 

CPU general-purpose register*. The content of GPR[0] is always zero. 

SGPR[s,x] 

In Release 2 of the Architecture, multiple copies of the CPU general-purpose registers may be implemented. 
SGPR[s,x] refers to GPR set s, register x. GPR[x] is a short-hand notation for SGPR[ SRSCtl css , x]. 

FPR[x] 

Floating Point operand register x 

FCC[CC] 

Floating Point condition code CC. FCC[0] has the same value as COC[ 1 ]. 

FPR[x] 

Floating Point (Coprocessor unit 1), general register x 

CPR[z,x,s] 

Coprocessor unit z, general register x, select s 

CP2CPR[x] 

Coprocessor unit 2, general register x 

CCR[z,x] 

Coprocessor unit z, control register x 

CP2CCR[x] 

Coprocessor unit 2, control register x 

COC[z] 

Coprocessor unit z condition signal 

Xlat[x] 

Translation of the MIPS16e GPR number x into the corresponding 32-bit GPR number 

BigEndianMem 

Endian mode as configured at chip reset (0 ^Little-Endian, 1 — » Big-Endian). Specifies the endianness of the 
memory interface (see LoadMemory and StoreMemory pseudocode function descriptions), and the endianness 
of Kernel and Supervisor mode execution. 


The endianness for load and store instructions (0 — > Little-Endian, 1 — » Big-Endian). In User mode, this 
BigEndianCPU endianness may be switched by setting the RE bit in the Status register. Thus, BigEndianCPU may be computed 
as (BigEndianMem XOR ReverseEndian). 
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Table 1-1 Symbols Used in Instruction Operation Statements 


Symbol 

Meaning 

ReverseEndian 

Signal to reverse the endianness of load and store instructions. This feature is available in User mode only, and 
is implemented by setting the RE bit of the Status register. Thus. ReverseEndian may be computed as (SRre and 
User mode). 

LLbit 

Bit of virtual state used to specify operation for instructions that provide atomic read-modify-write. LLbit is set 
when a linked load occurs; it is tested and cleared by the conditional store. It is cleared, during other CPU 
operation, when a store to the location would no longer be atomic. In particular, it is cleared by exception return 
instructions. 

I:, 

I+n:, 

I-n: 

This occurs as a prefix to Operation description lines and functions as a label. It indicates the instruction time 
during which the pseudocode appears to “execute.” Unless otherwise indicated, all effects of the current 
instruction appear to occur during the instruction time of the current instruction. No label is equivalent to a time 
label of I. Sometimes effects of an instruction appear to occur either earlier or later — that is. during the 
instruction time of another instruction. When this happens, the instruction operation is written in sections labeled 
with the instruction time, relative to the current instruction I, in which the effect of that pseudocode appears to 
occur. For example, an instruction may have a result that is not available until after the next instruction. Such an 
instruction has the portion of the instruction operation description that writes the result register in a section 
labeled 1+1. 


The effect of pseudocode statements for the current instruction labelled 1+1 appears to occur “at the same time” 
as the effect of pseudocode statements labeled I for the following instruction. Within one pseudocode sequence, 
the effects of the statements take place in order. However, between sequences of statements for different 
instructions that occur “at the same time," there is no defined order. Programs must not depend on a particular 
order of evaluation between such sections. 

PC 

The Program Counter value. During the instruction time of an instruction, this is the address of the instruction 
word. The address of the instruction that occurs during the next instruction time is determined by assigning a 
value to PC during an instruction time. If no value is assigned to PC during an instruction time by any 
pseudocode statement, it is automatically incremented by either 2 (in the case of a 16-bit MIPS16e instruction) 
or 4 before the next instruction time. A taken branch assigns the target address to the PC during the instruction 
time of the instruction in the branch delay slot. 

PABITS 

The number of physical address bits implemented is represented by the symbol PABITS. As such, if 36 physical 
address bits were implemented, the size of the physical address space would be 2 PABITS = 2 36 bytes. 


Indicates whether the FPU has 32-bit or 64-bit floating point registers (FPRs). In MIPS32. the FPU has 32 32-bit 
FPRs in which 64-bit data types are stored in even-odd pairs of FPRs. In MIPS64, the FPU has 32 64-bit FPRs 
in which 64-bit data types are stored in any FPR. 

FP32RegistersMode 

In MIPS32 implementations, FP32RegistersMode is always a 0. MIPS64 implementations have a compatibility 
mode in which the processor references the FPRs as if it were a MIPS32 implementation. In such a case 
FP32RegisterMode is computed from the FR bit in the Status register. If this bit is a 0, the processor operates 
as if it had 32 32-bit FPRs. If this bit is a 1, the processor operates with 32 64-bit FPRs. 


The value of FP32RegistersMode is computed from the FR bit in the Status register. 

InstructionlnB ranchD 
elaySlot 

Indicates whether the instruction at the Program Counter address was executed in the delay slot of a branch or 
jump. This condition reflects the dynamic state of the instruction, not the static state. That is, the value is false 
if a branch or jump occurs to an instruction whose PC immediately follows a branch or jump, but which is not 
executed in the delay slot of a branch or jump. 

SignalExceptionCexce 
ption, argument) 

Causes an exception to be signaled, using the exception parameter as the type of exception and the argument 
parameter as an exception-specific argument). Control does not return from this pseudocode function - the 
exception is signaled at the point of the call. 


1.4 For More Information 

Various MIPS RISC processor manuals and additional information about MIPS products can be found at the MIPS URL: 
http://www.mips.com 
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1.4 For More Information 


Comments or questions on the MIPS32™ Architecture or this document should be directed to 

Director of MIPS Architecture 
MIPS Technologies, Inc. 

1225 Charleston Road 
Mountain View, CA 94043 

or via E-mail to architecture@mips.com. 
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Chapter 2 

Guide to the Instruction Set 


This chapter provides a detailed guide to understanding the instruction descriptions, which are listed in alphabetical 
order in the tables at the beginning of the next chapter. 

2.1 Understanding the Instruction Fields 

Figure 2-1 shows an example instruction. Following the figure are descriptions of the fields listed below: 

• “Instruction Fields” on page 8 

• “Instruction Descriptive Name and Mnemonic” on page 9 

• “Format Field” on page 9 

• “Purpose Field” on page 10 

• “Description Field” on page 10 

• “Restrictions Field” on page 10 

• “Operation Field” on page 1 1 

• “Exceptions Field” on page 1 1 

• “Programming Notes and Implementation Notes Fields” on page 1 1 
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Instruction Mnemonic 
and Descriptive Name 

Instruction encoding ■ 
constant and variable 
field names and values 

Architecture level at 
which instruction was 
defined/redefined and 
assembler format(s) for 
each definition 


Example Instruction Name 





EXAMPLE 


31 26 

25 21 

20 

16 

15 11 

10 6 

5 0 


SPECIAL 

000000 

rs 

rt 

rd 

0 

0 0 0 0 0 

EXAMPLE 

000000 


-Format: example rd, rs,rt 


MIPS32 


Short description 


Purpose: to execute an EXAMPLE op 


Symbolic description 

Full description of 
instruction operation 



Description: rd rs exampleop rt 

This section describes the operation of the instruction in text, tables, and 
illustrations. It includes information that would be difficult to encode in the 
Operation section. 


Restrictions on . 
instruction and 
operands 


High-level language 

description of instruction 
operation 


Restrictions: 

This section lists any restrictions for the instruction. This can include values of the 
instruction encoding fields such as register specifiers, operand values, operand 
formats, address alignment, instruction scheduling hazards, and type of memory 
access for addressed locations. 

Operation: 

/* This section describes the operation of an instruction in a */ 

/* high-level pseudo-language. It is precise in ways that the */ 

/* Description section is not, but is also missing information */ 

/* that is hard to express in pseudocode.*/ 

temp <— GPR[rs] exampleop GPR[rt] 

GPR[rd]<— temp 


Exceptions that 

instruction can cause 


Exceptions: 

A list of exceptions taken by the instruction 


Notes for programmers ► Programming Notes: 

Information useful to programmers, but not necessary to describe the operation of 
the instruction 

Notes for implementors ^ 

► Implementation Notes: 

Like Programming Notes, except for processor implementors 


Figure 2-1 Example of Instruction Description 


2.1.1 Instruction Fields 

Fields encoding the instruction word are shown in register form at the top of the instruction description. The following 
rules are followed: 
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• The values of constant fields and the opcode names are listed in uppercase (SPECIAL and ADD in Figure 2-2). 
Constant values in a field are shown in binary below the symbolic or hexadecimal value. 

• All variable fields are listed with the lowercase names used in the instruction description ( rs , rt and rd in Figure 2-2). 

• Fields that contain zeros but are not named are unused fields that are required to be zero (bits 10:6 in Figure 2-2). If 
such fields are set to non-zero values, the operation of the processor is UNPREDICTABLE. 


31 26 25 21 20 16 15 11 10 6 5 0 


SPECIAL 




0 

ADD 


rs 

rt 

rd 



000000 




00000 

100000 


6 5 5 5 5 6 


Figure 2-2 Example of Instruction Fields 


2.1.2 Instruction Descriptive Name and Mnemonic 


The instruction descriptive name and mnemonic are printed as page headings for each instruction, as shown in Figure 
2-3. 


Add Word 


ADD 


Figure 2-3 Example of Instruction Descriptive Name and Mnemonic 


2.1.3 Format Field 

The assembler formats for the instruction and the architecture level at which the instruction was originally defined are 
given in the Format field. If the instruction definition was later extended, the architecture levels at which it was extended 
and the assembler formats for the extended definition are shown in their order of extension (for an example, see 
C.cond.fmt). The MIPS architecture levels are inclusive; higher architecture levels include all instructions in previous 
levels. Extensions to instructions are backwards compatible. The original assembler formats are valid for the extended 
architecture. 


Format: add rd, rs, rt MIPS32 

Figure 2-4 Example of Instruction Format 


The assembler format is shown with literal parts of the assembler instruction printed in uppercase characters. The 
variable parts, the operands, are shown as the lowercase names of the appropriate fields. The architectural level at which 
the instruction was first defined, for example “MIPS32” is shown at the right side of the page. 

There can be more than one assembler format for each architecture level. Floating point operations on formatted data 
show an assembly format with the actual assembler mnemonic for each valid value of the /mf field. For example, the 
ADD.fmt instruction lists both ADD.S and ADD.D. 


MIPS32™ Architecture For Programmers Volume II, Revision 2.00 


9 


Copyright © 2001-2003 MIPS Technologies Inc. All rights reserved. 



Chapter 2 Guide to the Instruction Set 

The assembler format lines sometimes include parenthetical comments to help explain variations in the formats (once 
again, see C.cond.fmt). These comments are not a part of the assembler format. 

2.1.4 Purpose Field 

The Purpose field gives a short description of the use of the instruction. 


Purpose: 

To add 32-bit integers. If an overflow occurs, then trap. 

Figure 2-5 Example of Instruction Purpose 


2.1.5 Description Field 

If a one-line symbolic description of the instruction is feasible, it appears immediately to the right of the Description 
heading. The main purpose is to show how fields in the instruction are used in the arithmetic or logical operation. 


Description: rd <- rs + rt 

The 32-bit word value in GPR rt is added to the 32-bit value in GPR rs to produce a 32-bit result. 

• If the addition results in 32-bit 2’s complement arithmetic overflow, the destination register is not modified and 
an Integer Overflow exception occurs 

• If the addition does not overflow, the 32-bit result is placed into GPR rd 

Figure 2-6 Example of Instruction Description 


The body of the section is a description of the operation of the instruction in text, tables, and figures. This description 
complements the high-level language description in the Operation section. 

This section uses acronyms for register descriptions. “GPR rt ” is CPU general-purpose register specified by the 
instruction field rt. “FPR/v” is the floating point operand register specified by the instruction field fs. “CPI register fdT 
is the coprocessor 1 general register specified by the instruction field fd. “ FCSR ” is the floating point Control /Status 
register. 


2.1.6 Restrictions Field 

The Restrictions field documents any possible restrictions that may affect the instruction. Most restrictions fall into one 
of the following six categories: 

• Valid values for instruction fields (for example, see floating point ADD.fmt) 

• ALIGNMENT requirements for memory addresses (for example, see LW) 

• Valid values of operands (for example, see DADD) 

• Valid operand formats (for example, see floating point ADD.fmt) 

• Order of instructions necessary to guarantee correct execution. These ordering constraints avoid pipeline hazards for 
which some processors do not have hardware interlocks (for example, see MUL). 

• Valid memory access types (for example, see LL/SC) 
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Restrictions: 

None 


Figure 2-7 Example of Instruction Restrictions 


2.1.7 Operation Field 

The Operation field describes the operation of the instruction as pseudocode in a high-level language notation 
resembling Pascal. This formal description complements the Description section; it is not complete in itself because 
many of the restrictions are either difficult to include in the pseudocode or are omitted for legibility. 


Operation: 

temp <- (GPR [rs] 31 | | GPR [rs] 31 _ _ 0 ) + (GPR [ rt ] 31 | | GPR [ rt ] 31 _ 0 ) 
if temp 32 ^ temp 31 then 

Signal Except ion ( IntegerOverf low) 
else 

GPR[rd] 4— temp 
endif 


Figure 2-8 Example of Instruction Operation 


See Section 2.2, "Operation Section Notation and Functions" on page 12 for more information on the formal notation 
used here. 


2.1.8 Exceptions Field 

The Exceptions field lists the exceptions that can be caused by Operation of the instruction. It omits exceptions that can 
be caused by the instruction fetch, for instance, TLB Refill, and also omits exceptions that can be caused by 
asynchronous external events such as an Interrupt. Although a Bus Error exception may be caused by the operation of a 
load or store instruction, this section does not list Bus Error for load and store instructions because the relationship 
between load and store instructions and external error indications, like Bus Error, are dependent upon the 
implementation. 


Exceptions: 

Integer Overflow 

Figure 2-9 Example of Instruction Exception 


An instruction may cause implementation-dependent exceptions that are not present in the Exceptions section. 


2.1.9 Programming Notes and Implementation Notes Fields 
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The Notes sections contain material that is useful for programmers and implementors, respectively, but that is not 
necessary to describe the instruction and does not belong in the description sections. 


Programming Notes: 

ADDU performs the same arithmetic operation but does not trap on overflow. 

Figure 2-10 Example of Instruction Programming Notes 


2.2 Operation Section Notation and Functions 

In an instruction description, the Operation section uses a high-level language notation to describe the operation 
performed by each instruction. Special symbols used in the pseudocode are described in the previous chapter. Specific 
pseudocode functions are described below. 

This section presents information about the following topics: 

• “Instruction Execution Ordering” on page 12 

• “Pseudocode Functions” on page 12 


2.2.1 Instruction Execution Ordering 

Each of the high-level language statements in the Operations section are executed sequentially (except as constrained 
by conditional and loop constructs). 


2.2.2 Pseudocode Functions 

There are several functions used in the pseudocode descriptions. These are used either to make the pseudocode more 
readable, to abstract implementation-specific behavior, or both. These functions are defined in this section, and include 
the following: 

• “Coprocessor General Register Access Functions” on page 12 

• “Load Memory and Store Memory Functions” on page 14 

• “Access Functions for Floating Point Registers” on page 16 

• “Miscellaneous Functions” on page 18 

2.2.2.1 Coprocessor General Register Access Functions 

Defined coprocessors, except for CPO, have instructions to exchange words and doublewords between coprocessor 
general registers and the rest of the system. What a coprocessor does with a word or doubleword supplied to it and how 
a coprocessor supplies a word or doubleword is defined by the coprocessor itself. This behavior is abstracted into the 
functions described in this section. 

COP_LW 

The COP_LW function defines the action taken by coprocessor z when supplied with a word from memory during a load 
word operation. The action is coprocessor-specific. The typical action would be to store the contents of memword in 
coprocessor general register rt. 
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COP_LW (z, rt, memword) 

z: The coprocessor unit number 

rt: Coprocessor general register specifier 

memword : A 32-bit word value supplied to the coprocessor 

/* Coprocessor-dependent action */ 

endf unction COP_LW 

Figure 2-11 COP_LW Pseudocode Function 


COP_LD 

The COP_LD function defines the action taken by coprocessor z when supplied with a doubleword from memory during 
a load doubleword operation. The action is coprocessor-specific. The typical action would be to store the contents of 
memdouble in coprocessor general register rt. 

COP_LD (z, rt , memdouble) 

z: The coprocessor unit number 

rt: Coprocessor general register specifier 

memdouble: 64-bit doubleword value supplied to the coprocessor. 

/* Coprocessor-dependent action */ 
endfunction COP_LD 

Figure 2-12 COP_LD Pseudocode Function 


COP_SW 

The COP_S W function defines the action taken by coprocessor z to supply a word of data during a store word operation. 
The action is coprocessor-specific. The typical action would be to supply the contents of the low-order word in 
coprocessor general register rt. 

dataword <— COP_SW (z, rt) 

z: The coprocessor unit number 

rt: Coprocessor general register specifier 

dataword: 32-bit word value 

/* Coprocessor-dependent action */ 

endfunction COP_SW 

Figure 2-13 COP_SW Pseudocode Function 


COP_SD 

The COP_SD function defines the action taken by coprocessor z to supply a doubleword of data during a store 
doubleword operation. The action is coprocessor-specific. The typical action would be to supply the contents of the 
low-order doubleword in coprocessor general register rt. 

datadouble <— COP_SD (z, rt) 

z: The coprocessor unit number 

rt: Coprocessor general register specifier 

datadouble : 64-bit doubleword value 

/* Coprocessor-dependent action */ 
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endfunction COP_SD 

Figure 2-14 COP_SD Pseudocode Function 


2.2.2.2 Load Memory and Store Memory Functions 

Regardless of byte ordering (big- or little-endian), the address of a halfword, word, or doubleword is the smallest byte 
address of the bytes that form the object. For big-endian ordering this is the most-significant byte; for a little-endian 
ordering this is the least-significant byte. 

In the Operation pseudocode for load and store operations, the following functions summarize the handling of virtual 
addresses and the access of physical memory. The size of the data item to be loaded or stored is passed in the 
AccessLengtli field. The valid constant names and values are shown in Table 2-1. The bytes within the addressed unit of 
memory (word for 32-bit processors or doubleword for 64-bit processors) that are used can be determined directly from 
the AccessLengtli and the two or three low-order bits of the address. 


AddressTranslation 


The AddressTranslation function translates a virtual address to a physical address and its cache coherence algorithm, 
describing the mechanism used to resolve the memory reference. 


Given the virtual address vAddr, and whether the reference is to Instructions or Data ( lorD ), find the corresponding 
physical address (pAddr) and the cache coherence algorithm (CCA) used to resolve the reference. If the virtual address 
is in one of the unmapped address spaces, the physical address and CCA are determined directly by the virtual address. 
If the virtual address is in one of the mapped address spaces then the TLB or fixed mapping MMU determines the 
physical address and access type; if the required translation is not present in the TLB or the desired access is not 
permitted, the function fails and an exception is taken. 


(pAddr, CCA) <— AddressTranslation (vAddr, IorD, LorS) 


/* 

pAddr: 

/* 

CCA: i 

/* 


/* 

vAddr: ■ 

/* 

IorD: 

/* 

LorS: 

/* 

See the 

/* 

type in 


Cache Coherence Algorithm, the method used to access caches*/ 
and memory and resolve the reference */ 


endfunction AddressTranslation 

Figure 2-15 AddressTranslation Pseudocode Function 


LoadMemory 

The LoadMemory function loads a value from memory. 

This action uses cache and main memory as specified in both the Cache Coherence Algorithm (CCA) and the access 
{IorD) to find the contents of AccessLengtli memory bytes, starting at physical location pAddr. The data is returned in a 
fixed-width naturally aligned memory element {MemElem). The low-order 2 (or 3) bits of the address and the 
AccessLengtli indicate which of the bytes within MemElem need to be passed to the processor. If the memory access type 
of the reference is uncached, only the referenced bytes are read from memory and marked as valid within the memory 
element. If the access type is cached but the data is not present in cache, an implementation-specific size and alignment 
block of memory is read and loaded into the cache to satisfy a load reference. At a minimum, this block is the entire 
memory element. 
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MemElem LoadMemory (CCA, AccessLength, pAddr, vAddr, IorD) 


/* 

/* 

/* 

/* 

/* 

/* 


MemElem: 


CCA: 


Data is returned in a fixed width with a natural alignment. The */ 
width is the same size as the CPU general-purpose register, */ 

32 or 64 bits, aligned on a 32- or 64-bit boundary, */ 
respectively. */ 

Cache Coherence Algorithm, the method used to access caches */ 
and memory and resolve the reference */ 


/* AccessLength: Length, in bytes, of access */ 

/* pAddr: physical address */ 

/* vAddr: virtual address */ 

/* IorD: Indicates whether access is for Instructions or Data */ 


endfunction LoadMemory 

Figure 2-16 LoadMemory Pseudocode Function 


StoreMemory 

The StoreMemory function stores a value to memory. 


The specified data is stored into the physical location pAddr using the memory hierarchy (data caches and main memory) 
as specified by the Cache Coherence Algorithm (CCA). The MemElem contains the data for an aligned, fixed-width 
memory element (a word for 32-bit processors, a doubleword for 64-bit processors), though only the bytes that are 
actually stored to memory need be valid. The low-order two (or three) bits of pAddr and the AccessLength field indicate 
which of the bytes within the MemElem data should be stored; only these bytes in memory will actually be changed. 


StoreMemory (CCA, AccessLength, MemElem, pAddr, vAddr) 


/* 

/* 

/* 

/* 

/* 

/* 

/* 

/* 

/* 

/* 

/* 


CCA: Cache Coherence Algorithm, the method used to access */ 

caches and memory and resolve the reference. */ 
AccessLength: Length, in bytes, of access */ 

MemElem: Data in the width and alignment of a memory element. */ 

The width is the same size as the CPU general */ 
purpose register, either 4 or 8 bytes, */ 
aligned on a 4- or 8-byte boundary. For a */ 
partial-memory-element store, only the bytes that will be*/ 
stored must be valid.*/ 
pAddr: physical address */ 

vAddr: virtual address */ 


endfunction StoreMemory 

Figure 2-17 StoreMemory Pseudocode Function 


Prefetch 

The Prefetch function prefetches data from memory. 

Prefetch is an advisory instruction for which an implementation-specific action is taken. The action taken may increase 
performance but must not change the meaning of the program or alter architecturally visible state. 

Prefetch (CCA, pAddr, vAddr, DATA, hint) 

/* CCA: Cache Coherence Algorithm, the method used to access */ 

/* caches and memory and resolve the reference. */ 

/* pAddr: physical address */ 
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/* vAddr: virtual address */ 

/* DATA: Indicates that access is for DATA */ 

/* hint: hint that indicates the possible use of the data */ 

endfunction Prefetch 

Figure 2-18 Prefetch Pseudocode Function 

Table 2-1 lists the data access lengths and their labels for loads and stores. 

Table 2-1 AccessLength Specifications for Loads/Stores 


AccessLength Name 

Value 

Meaning 

DOUBLEWORD 

7 

8 bytes (64 bits) 

SEPTIBYTE 

6 

7 bytes (56 bits) 

SEXTIBYTE 

5 

6 bytes (48 bits) 

QUINTIBYTE 

4 

5 bytes (40 bits) 

WORD 

3 

4 bytes (32 bits) 

TRIPLEBYTE 

2 

3 bytes (24 bits) 

HALFWORD 

1 

2 bytes (16 bits) 

BYTE 

0 

1 byte (8 bits) 


2.2.2.3 Access Functions for Floating Point Registers 

The pseudocode shown in below specifies how the unformatted contents loaded or moved to CP 1 registers are interpreted 
to form a formatted value. If an FPR contains a value in some format, rather than unformatted contents from a load 
(uninterpreted), it is valid to interpret the value in that format (but not to interpret it in a different format). 


ValueFPR 

The ValueFPR function returns a formatted value from the floating point registers. 

value <— ValueFPR ( fpr, fmt) 

/* value: The formattted value from the FPR */ 

/* fpr: The FPR number */ 

/* fmt: The format of the data, one of: */ 

/* S, D, W, L, PS, */ 

/* OB, QH, */ 

/* UNINTERP RETED_WORD , */ 

/* UNINTERP RETED_DOUBLEWORD */ 

/* The UNINTERPRETED values are used to indicate that the datatype */ 

/* is not known as, for example, in SWC1 and SDC1 */ 

case fmt of 

S, W, UNINTERP RETED_WORD : 
valueFPR <— FPR [fpr] 

D, UNINTERP RETED_DOUBLEWORD : 
if (FP32RegistersMode = 0) 

if (fpr 0 ^ 0) then 

valueFPR <- UNPREDICTABLE 
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else 

valueFPR 4— FPR [ fpr+1 ] 31 . , g II FPR [ fpr] 31 _ _ 0 
endif 
else 

valueFPR 4— FPR[fpr] 
endif 

L, PS : 

if (FP32RegistersMode = 0) then 
valueFPR 4- UNPREDICTABLE 

else 

valueFPR 4— FPR[fpr] 
endif 

DEFAULT : 

valueFPR 4- UNPREDICTABLE 

endcase 

endfunction ValueFPR 


Figure 2-19 ValueFPR Pseudocode Function 


StoreFPR 


The pseudocode shown below specifies the way a binary encoding representing a formatted value is stored into CPI 
registers by a computational or move operation. This binary representation is visible to store or move-front instructions. 
Once an FPR receives a value from the StoreFPR(), it is not valid to interpret the value with ValueFPRt ) in a different 
format. 


StoreFPR (fpr, fmt, value) 


/* 

/* 

/* 

/* 

/* 

/* 

/* 


fpr: The FPR number */ 

fmt: The format of the data, one of: */ 

S, D, W, L, PS, */ 

OB, QH, */ 

UNINTERP RETED_WORD , */ 

UNINTERP RETED_DOUBLEWORD */ 

value: The formattted value to be stored into the FPR */ 


/* The UNINTERPRETED values are used to indicate that the datatype */ 
/* is not known as, for example, in LWC1 and LDC1 */ 


case fmt of 

S, W, UNINTERP RETED_WORD : 

FPR[fpr] <— value 

D, UNINTERP RETED_DOUBLEWORD : 
if (FP32RegistersMode = 0) 
if (fpr 0 ^ 0) then 

UNPREDICTABLE 

else 

FPR [fpr] 4- UNPREDICTABLE 32 || value 31--0 

FPR [fpr+1] 4- UNPREDICTABLE 32 || value 63 . . 3 2 
endif 
else 

FPR [ fpr ] 4— value 
endif 
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L, PS : 

if (FP32RegistersMode = 0) then 

UNPREDICTABLE 

else 

FPR[fpr] <— value 
endif 

endcase 

endfunction StoreFPR 

Figure 2-20 StoreFPR Pseudocode Function 
2.2.2.4 Miscellaneous Functions 

This section lists miscellaneous functions not covered in previous sections. 

SyncOperation 

The SyncOperation function orders loads and stores to synchronize shared memory. 

This action makes the effects of the synchronizable loads and stores indicated by stype occur in the same order for all 
processors. 

SyncOperation (stype) 

/* stype: Type of load/store ordering to perform. */ 

/* Perform implementation-dependent operation to complete the */ 

/* required synchronization operation */ 

endfunction SyncOperation 

Figure 2-21 SyncOperation Pseudocode Function 


SignalException 

The SignalException function signals an exception condition. 

This action results in an exception that aborts the instruction. The instruction operation pseudocode never sees a return 
from this function call. 

SignalException (Exception, argument) 

/* Exception: The exception condition that exists. */ 

/* argument: A exception-dependent argument, if any */ 

endfunction SignalException 

Figure 2-22 SignalException Pseudocode Function 

SignalDebugBreakpointException 

The SignalDebugBreakpointException function signals a condition that causes entry into Debug Mode from non-Debug 
Mode. 

This action results in an exception that aborts the instruction. The instruction operation pseudocode never sees a return 
from this function call. 
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SignalDebugBreakpointException ( ) 

endf unction SignalDebugBreakpointException 

Figure 2-23 SignalDebugBreakpointException Pseudocode Function 

SignalDebugModeBreakpointException 

The SignalDebugModeBreakpointException function signals a condition that causes entry into Debug Mode from 
Debug Mode (i.e., an exception generated while already running in Debug Mode). 

This action results in an exception that aborts the instruction. The instruction operation pseudocode never sees a return 
from this function call. 

SignalDebugModeBreakpointException ( ) 

endf unction SignalDebugModeBreakpoint Except ion 

Figure 2-24 SignalDebugModeBreakpointException Pseudocode Function 

Nullify Currentlnstruction 

The NullifyCurrentlnstruction function nullifies the current instruction. 

The instruction is aborted, inhibiting not only the functional effect of the instruction, but also inhibiting all exceptions 
detected during fetch, decode, or execution of the instruction in question. For branch-likely instructions, nullification 
kills the instruction in the delay slot of the branch likely instruction. 

NullifyCurrentlnstruction () 

endf unction NullifyCurrentlnstruction 

Figure 2-25 NullifyCurrentlnstruction Pseudocode Function 

CoprocessorOperation 

The CoprocessorOperation function performs the specified Coprocessor operation. 

CoprocessorOperation (z, cop_fun) 

/* z: Coprocessor unit number */ 

/* cop_fun: Coprocessor function from function field of instruction */ 

/* Transmit the cop_fun value to coprocessor z */ 

endfunction CoprocessorOperation 

Figure 2-26 CoprocessorOperation Pseudocode Function 

JumpDelaySlot 

The JumpDelaySlot function is used in the pseudocode for the PC-relative instructions in the MIPS16e ASE. The 
function returns TRUE if the instruction at vAddr is executed in a jump delay slot. A jump delay slot always immediately 
follows a JR, JAL, JALR, or JALX instruction. 

JumpDelaySlot (vAddr) 
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/* vAddr : Virtual address */ 

endfunction JumpDelaySlot 

Figure 2-27 JumpDelaySlot Pseudocode Function 

FPConditionCode 

The FPConditionCode function returns the value of a specific floating point condition code. 

tf 4— FPConditionCode (cc) 

/* tf: The value of the specified condition code */ 

/* cc: The Condition code number in the range 0. .7 */ 

if cc = 0 then 

FPConditionCode 4— FCSR 2 3 
else 

FPConditionCode 4— FCSR 2 4 +cc 
endif 

endfunction FPConditionCode 

Figure 2-28 FPConditionCode Pseudocode Function 


SetFPConditionCode 

The SetFPConditionCode function writes a new value to a specific floating point condition code. 

SetFPConditionCode (cc) 
if cc = 0 then 

FCSR 4- FCSR 31 .. 24 II tf || FCSR 22 ..o 
else 

FCSR 4- FCSR 31 .. 25+cc I! tf || FCSR 23+cc .. 0 

endif 

endfunction SetFPConditionCode 

Figure 2-29 SetFPConditionCode Pseudocode Function 


2.3 Op and Function Subfield Notation 

In some instructions, the instruction subfields op and function can have constant 5- or 6-bit values. When reference is 
made to these instructions, uppercase mnemonics are used. For instance, in the floating point ADD instruction, 
pp=COPl and function = A DD . In other cases, a single field has both fixed and variable subfields, so the name contains 
both upper- and lowercase characters. 


2.4 FPU Instructions 

In the detailed description of each FPU instruction, all variable subfields in an instruction format (such as fs, ft, 
immediate, and so on) are shown in lowercase. The instruction name (such as ADD, SUB, and so on) is shown in 
uppercase. 
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For the sake of clarity, an alias is sometimes used for a variable subfield in the formats of specific instructions. For 
example, rs=base in the format for load and store instructions. Such an alias is always lowercase since it refers to a 
variable subfield. 

Bit encodings for mnemonics are given in Volume I, in the chapters describing the CPU, FPU, MDMX, and MIPS16e 
instructions. 

See Section 2.3, "Op and Function Subfield Notation" on page 20 for a description of the op and function subfields. 
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Chapter 3 

The MIPS32™ Instruction Set 


3.1 Compliance and Subsetting 

To be compliant with the MIPS32 Architecture, designs must implement a set of required features, as described in this 
document set. To allow flexibility in implementations, the MIPS32 Architecture does provide subsetting rules. An 
implementation that follows these rules is compliant with the MIPS 32 Architecture as long as it adheres strictly to the 
rules, and fully implements the remaining instructions. Supersetting of the MIPS32 Architecture is only allowed by 
adding functions to the SPECIAL2 major opcode, by adding control for co-processors via the COP2, LWC2, SWC2, 
LDC2, and/or SDC2, and/or COP3 opcodes, or via the addition of approved Application Specific Extensions. Note, 
however, that a decision to use the COP3 opcode in an implementation of the MIPS32 Architecture precludes a 
compatible upgrade to the MIPS64 Architecture because the COP 3 opcode is used as part of the floating point ISA in 
the MIPS64 Architecture. 

The instruction set subsetting rules are as follows: 

• All CPU instructions must be implemented - no subsetting is allowed. 

• The FPU and related support instructions, including the MOVF and MOVT CPU instructions, may be omitted. 
Software may determine if an FPU is implemented by checking the state of the FP bit in the Configl CPO register. If 
the FPU is implemented, it must include S, D, and W formats, operate instructions, and all supporting instructions. 
Software may determine which FPU data types are implemented by checking the appropriate bit in the FIR CPI 
register. The following allowable FPU subsets are compliant with the MIPS 32 architecture: 

- No FPU 

- FPU with S, D, and W formats and all supporting instructions 

• Coprocessor 2 is optional and may be omitted. Software may determine if Coprocessor 2 is implemented by 
checking the state of the C2 bit in the Configl CPO register. If Coprocessor 2 is implemented, the Coprocessor 2 
interface instructions (BC2, CFC2, COP2, CTC2, LDC2, LWC2, MFC2, MTC2, SDC2, and SWC2) may be omitted 
on an instruction-by-instruction basis. 

• Supervisor Mode is optional. If Supervisor Mode is not implemented, bit 3 of the Status register must be ignored on 
write and read as zero. 

• The standard TLB-based memory management unit may be replaced with a simpler MMU (e.g., a Fixed Mapping 
MMU). If this is done, the rest of the interface to the Privileged Resource Architecture must be preserved. If a 
TLB-based memory management unit is implemented, it must be the standard TLB-based MMU as described in the 
Privileged Resource Architecture chapter. Software may determine the type of the MMU by checking the MT field in 
the Config CPO register. 

• The Privileged Resource Architecture includes several implementation options and may be subsetted in accordance 
with those options. 

• Instruction, CPO Register, and CPI Control Register fields that are marked “Reserved” or shown as “0” in the 
description of that field are reserved for future use by the architecture and are not available to implementations. 
Implementations may only use those fields that are explicitly reserved for implementation dependent use. 

• Supported ASEs are optional and may be subsetted out. If most cases, software may determine if a supported ASE is 
implemented by checking the appropriate bit in the Configl or Config3 CPO register. If they are implemented, they 
must implement the entire ISA applicable to the component, or implement subsets that are approved by the ASE 
specifications. 

• EJTAG is optional and may be subsetted out. If it is implemented, it must implement only those subsets that are 
approved by the EJTAG specification. 
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• If any instruction is subsetted out based on the rules above, an attempt to execute that instruction must cause the 
appropriate exception (typically Reserved Instruction or Coprocessor Unusable). 


3.2 Alphabetical List of Instructions 

Table 3-1 through Table 3-24 provide a list of instructions grouped by category. Individual instruction descriptions 
follow the tables, arranged in alphabetical order. 


Table 3-1 CPU Arithmetic Instructions 


Mnemonic 

Instruction 


ADD 

Add Word 


ADDI 

Add Immediate Word 


ADDIU 

Add Immediate Unsigned Word 


ADDU 

Add Unsigned Word 


CLO 

Count Leading Ones in Word 


CLZ 

Count Leading Zeros in Word 


DIV 

Divide Word 


DIVU 

Divide Unsigned Word 


MADD 

Multiply and Add Word to Hi, Lo 


MADDU 

Multiply and Add Unsigned Word to Hi, Lo 


MSUB 

Multiply and Subtract Word to Hi, Lo 


MSUBU 

Multiply and Subtract Unsigned Word to Hi, Lo 


MUL 

Multiply Word to GPR 


MULT 

Multiply Word 


MULTU 

Multiply Unsigned Word 


SEB 

Sign-Extend Byte 

Release 2 Only 

SEH 

Sign-Extend Halftword 

Release 2 Only 

SLT 

Set on Less Than 


SLTI 

Set on Less Than Immediate 


SLTIU 

Set on Less Than Immediate Unsigned 


SLTU 

Set on Less Than Unsigned 


SUB 

Subtract Word 


SUBU 

Subtract Unsigned Word 



Table 3-2 CPU Branch and Jump Instructions 

Mnemonic 

Instruction 


B 

Unconditional Branch 
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3.2 Alphabetical List of Instructions 


Table 3-2 CPU Branch and Jump Instructions 


Mnemonic 

Instruction 


BAL 

Branch and Link 


BEQ 

Branch on Equal 


BGEZ 

Branch on Greater Than or Equal to Zero 


BGEZAL 

Branch on Greater Than or Equal to Zero and Link 


BGTZ 

Branch on Greater Than Zero 


BT.F.Z, 

Branch on Less Than or Equal to Zero 


BLTZ 

Branch on Less Than Zero 


BLTZAL 

Branch on Less Than Zero and Link 


BNE 

Branch on Not Equal 


J 

Jump 


JAL 

Jump and Link 


JALR 

Jump and Link Register 


JALR.HB 

Jump and Link Register with Hazard Barrier 

Release 2 Only 

JR 

Jump Register 


JR.HB 

Jump Register with Hazard Barrier 

Release 2 Only 


Table 3-3 CPU Instruction Control Instructions 


Mnemonic 

Instruction 


EHB 

Execution Hazard Barrier 

Release 2 Only 

NOP 

No Operation 


SSNOP 

Superscalar No Operation 



Table 3-4 CPU Load, Store, and Memory Control Instructions 

Mnemonic 

Instruction 


LB 

Load Byte 


LBU 

Load Byte Unsigned 


LH 

Load Halfword 


LHU 

Load Halfword Unsigned 


LL 

Load Linked Word 


LW 

Load Word 


LWL 

Load Word Left 


LWR 

Load Word Right 


PREF 

Prefetch 
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Chapter 3 The MIPS32™ Instruction Set 


Table 3-4 CPU Load, Store, and Memory Control Instructions 


Mnemonic 

Instruction 


SB 

Store Byte 


SC 

Store Conditional Word 


SD 

Store Doubleword 


SH 

Store Halfword 


SW 

Store Word 


SWL 

Store Word Left 


SWR 

Store Word Right 


SYNC 

Synchronize Shared Memory 


SYNCI 

Synchronize Caches to Make Instruction Writes Effective 

Release 2 Only 


Table 3-5 CPU Logical Instructions 


Mnemonic 

Instruction 

AND 

And 

ANDI 

And Immediate 

LUI 

Load Upper Immediate 

NOR 

Not Or 

OR 

Or 

ORI 

Or Immediate 

XOR 

Exclusive Or 

XORI 

Exclusive Or Immediate 


Table 3-6 CPU Insert/Extract Instructions 


Mnemonic 

Instruction 


EXT 

Extract Bit Field 

Release 2 Only 

INS 

Insert Bit Field 

Release 2 Only 

WSBH 

Word Swap Bytes Within Halfwords 

Release 2 Only 


Table 3-7 CPU Move Instructions 

Mnemonic 

Instruction 

MFHI 

Move From HI Register 

MFLO 

Move From LO Register 

MOVF 

Move Conditional on Floating Point False 
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3.2 Alphabetical List of Instructions 


Table 3-7 CPU Move Instructions 


Mnemonic 

Instruction 


MOVN 

Move Conditional on Not Zero 

MOVT 

Move Conditional on Floating Point True 

MOVZ 

Move Conditional on Zero 

MTHI 

Move To HI Register 

MTLO 

Move To LO Register 

RDHWR 

Read Hardware Register 

Release 2 Only 


Table 3-8 CPU Shift Instructions 


Mnemonic 

Instruction 


ROTR 

Rotate Word Right 

Release 2 Only 

ROTRV 

Rotate Word Right Variable 

Release 2 Only 

SLL 

Shift Word Left Logical 


SLLV 

Shift Word Left Logical Variable 

SRA 

Shift Word Right Arithmetic 

SRAV 

Shift Word Right Arithmetic Variable 

SRL 

Shift Word Right Logical 

SRLV 

Shift Word Right Logical Variable 


Table 3-9 CPU Trap Instructions 


Mnemonic 

Instruction 

BREAK 

Breakpoint 

SYSCALL 

System Call 

TEQ 

Trap if Equal 

TEQI 

Trap if Equal Immediate 

TGE 

Trap if Greater or Equal 

TGEI 

Trap if Greater of Equal Immediate 

TGEIU 

Trap if Greater or Equal Immediate Unsigned 

TGEU 

Trap if Greater or Equal Unsigned 

TLT 

Trap if Less Than 

TLTI 

Trap if Less Than Immediate 

TLTIU 

Trap if Less Than Immediate Unsigned 

TLTU 

Trap if Less Than Unsigned 

TNE 

Trap if Not Equal 
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Table 3-9 CPU Trap Instructions 


Mnemonic 

Instruction 

TNEI 

Trap if Not Equal Immediate 


Table 3-10 Obsolete 1 CPU Branch Instructions 


Mnemonic 

Instruction 

BEQL 

Branch on Equal Likely 

BGEZALL 

Branch on Greater Than or Equal to Zero and Link Likely 

BGEZL 

Branch on Greater Than or Equal to Zero Likely 

BGTZL 

Branch on Greater Than Zero Likely 

BLF.Z.L 

Branch on Less Than or Equal to Zero Likely 

BLTZALL 

Branch on Less Than Zero and Link Likely 

BLTZL 

Branch on Less Than Zero Likely 

BNEL 

Branch on Not Equal Likely 


1 . Software is strongly encouraged to avoid use of the Branch Likely instructions, as they will be removed from 
a future revision of the MIPS 32 architecture. 


Table 3-11 FPU Arithmetic Instructions 


Mnemonic 

Instruction 

ABS.fmt 

Floating Point Absolute Value 

ADD.fmt 

Floating Point Add 

DIV.fmt 

Floating Point Divide 

M ADD.fmt 

Floating Point Multiply Add 

MSUB.fmt 

Floating Point Multiply Subtract 

MUL.fmt 

Floating Point Multiply 

NEG.fmt 

Floating Point Negate 

NM ADD.fmt 

Floating Point Negative Multiply Add 

NMSUB.fmt 

Floating Point Negative Multiply Subtract 

RECIP.fmt 

Reciprocal Approximation 

RSQRT.fmt 

Reciprocal Square Root Approximation 

SQRT 

Floating Point Square Root 

SUB.fmt 

Floating Point Subtract 
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3.2 Alphabetical List of Instructions 


Table 3-12 FPU Branch Instructions 


Mnemonic 

Instruction 

BC1F 

Branch on FP False 

BC1T 

Branch on FP True 


Table 3-13 FPU Compare Instructions 


Mnemonic 

Instruction 

C.cond.fmt 

Floating Point Compare 


Table 3-14 FPU Convert Instructions 


Mnemonic 

Instruction 


ALNV.PS 

Floating Point Align Variable 

64-bit FPU Only 

CEIL.L.fmt 

Floating Point Ceiling Convert to Long Fixed Point 

64-bit FPU Only 

CEIL.W.fmt 

Floating Point Ceiling Convert to Word Fixed Point 


CVT.D.fmt 

Floating Point Convert to Double Floating Point 

CVT.L.fmt 

Floating Point Convert to Long Fixed Point 

64-bit FPU Only 

CVT.PS.S 

Floating Point Convert Pair to Paired Single 

64-bit FPU Only 

CVT.S.PL 

Floating Point Convert Pair Lower to Single Floating Point 

64-bit FPU Only 

CVT.S.PU 

Floating Point Convert Pair Upper to Single Floating Point 

64-bit FPU Only 

CVT.S.fmt 

Floating Point Convert to Single Floating Point 


CVT.W.fmt 

Floating Point Convert to Word Fixed Point 

FLOOR.L.fmt 

Floating Point Floor Convert to Long Fixed Point 

64-bit FPU Only 

FLOOR. W.fmt 

Floating Point Floor Convert to Word Fixed Point 


PLL.PS 

Pair Lower Lower 

64-bit FPU Only 

PLU.PS 

Pair Lower Upper 

64-bit FPU Only 

PUL.PS 

Pair Upper Lower 

64-bit FPU Only 

PUU.PS 

Pair Upper Upper 

64-bit FPU Only 

ROUND. L.fmt 

Floating Point Round to Long Fixed Point 

64-bit FPU Only 

ROUND.W.fmt 

Floating Point Round to Word Fixed Point 


TRUNC.L.fmt 

Floating Point Truncate to Long Fixed Point 

64-bit FPU Only 

TRUNC.W.fmt 

Floating Point Truncate to Word Fixed Point 
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Table 3-15 FPU Load, Store, and Memory Control Instructions 


Mnemonic 

Instruction 


LDC1 

Load Doubleword to Floating Point 

LDXC1 

Load Doubleword Indexed to Floating Point 

64-bit FPU Only 

LUXC1 

Load Doubleword Indexed Unaligned to Floating Point 

64-bit FPU Only 

LWC1 

Load Word to Floating Point 


LWXC1 

Load Word Indexed to Floating Point 

64-bit FPU Only 

PREFX 

Prefetch Indexed 


SDC1 

Store Doubleword from Floating Point 

SDXC1 

Store Doubleword Indexed from Floating Point 

64-bit FPU Only 

SUXC1 

Store Doubleword Indexed Unaligned from Floating Point 

64-bit FPU Only 

SWC1 

Store Word from Floating Point 


SWXC1 

Store Word Indexed from Floating Point 

64-bit FPU Only 


Table 3-16 FPU Move Instructions 


Mnemonic 

Instruction 


CFC1 

Move Control Word from Floating Point 

CTC1 

Move Control Word to Floating Point 

MFC1 

Move Word from Floating Point 

MFHC1 

Move Word from High Half of Floating Point Register 

Release 2 Only 

MOV.fmt 

Floating Point Move 


MOVF.fmt 

Floating Point Move Conditional on Floating Point False 

MOVN.fmt 

Floating Point Move Conditional on Not Zero 

MOVT.fmt 

Floating Point Move Conditional on Floating Point True 

MOVZ.fmt 

Floating Point Move Conditional on Zero 

MTC1 

Move Word to Floating Point 

MTHC1 

Move Word to High Half of Floating Point Register 

Release 2 Only 


Table 3-17 Obsolete 1 FPU Branch Instructions 


Mnemonic 

Instruction 

BC1FL 

Branch on FP False Likely 

BC1TL 

Branch on FP True Likely 


1. Software is strongly encouraged to avoid use of the Branch Likely instructions, as they will be removed from 
a future revision of the MIPS 32 architecture. 
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3.2 Alphabetical List of Instructions 


Table 3-18 Coprocessor Branch Instructions 


Mnemonic 

Instruction 

BC2F 

Branch on COP2 False 

BC2T 

Branch on COP2 True 


Table 3-19 Coprocessor Execute Instructions 


Mnemonic 

Instruction 

COP2 

Coprocessor Operation to Coprocessor 2 


Table 3-20 Coprocessor Load and Store Instructions 


Mnemonic 

Instruction 

LDC2 

Load Doubleword to Coprocessor 2 

LWC2 

Load Word to Coprocessor 2 

SDC2 

Store Doubleword from Coprocessor 2 

SWC2 

Store Word from Coprocessor 2 


Table 3-21 Coprocessor Move Instructions 


Mnemonic 

Instruction 


CFC2 

Move Control Word from Coprocessor 2 

CTC2 

Move Control Word to Coprocessor 2 

MFC2 

Move Word from Coprocessor 2 

MFHC2 

Move Word from High Half of Coprocessor 2 Register 

Release 2 Only 

MTC2 

Move Word to Coprocessor 2 


MTHC2 

Move Word to High Half of Coprocessor 2 Register 

Release 2 Only 


Table 3-22 Obsolete 1 Coprocessor Branch Instructions 


Mnemonic 

Instruction 

BC2FL 

Branch on COP2 False Likely 

BC2TL 

Branch on COP2 True Likely 


1. Software is strongly encouraged to avoid use of the Branch Likely instructions, as they will be removed from 
a future revision of the MIPS 32 architecture. 


Table 3-23 Privileged Instructions 


Mnemonic 

Instruction 

CACHE 

Perform Cache Operation 
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Table 3-23 Privileged Instructions 


Mnemonic 

Instruction 


DI 

Disable Interrupts 

Release 2 Only 

El 

Enable Interrupts 

Release 2 Only 

ERET 

Exception Return 


MFC0 

Move from Coprocessor 0 

MTC0 

Move to Coprocessor 0 

RDPGPR 

Read GPR from Previous Shadow Set 

Release 2 Only 

TLBP 

Probe TLB for Matching Entry 


TLBR 

Read Indexed TLB Entry 

TLBWI 

Write Indexed TLB Entry 

TLBWR 

Write Random TLB Entry 

WAIT 

Enter Standby Mode 

WRPGPR 

Write GPR to Previous Shadow Set 

Release 2 Only 


Table 3-24 EJTAG Instructions 


Mnemonic 

Instruction 

DERET 

Debug Exception Return 

SDBBP 

Software Debug Breakpoint 
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Floating Point Absolute Value 


ABS.fmt 


31 26 25 21 20 16 15 11 10 6 5 0 


COP1 


0 



ABS 


fmt 


fs 

fd 


010001 


00000 



000101 


6 5 5 5 5 6 


Format: ABS.S fd, fs 

ABS.D fd, fs 
ABS.PS fd, fs 


MIPS32 
MIPS32 
MIPS64 
MIPS32 Release 2 


Purpose: 

To compute the absolute value of an FP value 
Description: fd 4- abs(fs) 

The absolute value of the value in FPR fs is placed in FPR fd. The operand and result are values in format fint. 
ABS.PS takes the absolute value of the two values in FPR fs independently, and ORs together any generated excep- 
tions. 

Cause bits are ORed into the Flag bits if no exception is taken. 

This operation is arithmetic; a NaN operand signals invalid operation. 

Restrictions: 

The fields fs and fd must specify FPRs valid for operands of type fint. If they are not valid, the result is UNPRE- 
DICTABLE. 

The operand must be a value in format fint, if it is not, the result is UNPREDICTABLE and the value of the operand 
FPR becomes UNPREDICTABLE. 

The result of ABS.PS is UNPREDICTABLE if the processor is executing in 16 FP registers mode. 

Operation: 

StoreFPR(fd, fmt, AbsoluteValue (ValueFPR (fs, fmt) ) ) 

Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Floating Point Exceptions: 

Unimplemented Operation, Invalid Operation 
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ADD 


Add Word 


31 26 25 21 20 16 15 11 10 6 5 0 


SPECIAL 




0 

ADD 


rs 

rt 

rd 



000000 




00000 

100000 


6 5 

5 

5 

5 

6 

Format: add rd, rs, rt 




MIPS32 

Purpose: 






To add 32-bit integers. If an overflow occurs, then trap. 

Description: rd <- rs + rt 

The 32-bit word value in GPR rt is added to the 32-bit value in GPR rs to produce a 32-bit result. 

• If the addition results in 32-bit 2’s complement arithmetic overflow, the destination register is not modified and 
an Integer Overflow exception occurs. 

• If the addition does not overflow, the 32-bit result is placed into GPR rd. 

Restrictions: 

None 

Operation: 

temp <- (GPR [rs] 31 | | GPR [rs] 31 . -0 ) + (GPR [ rt ] 31 | | GPR [rt ] 31 . . 0 ) 
if temp 32 A temp 31 then 

Signal Except ion ( IntegerOverf low) 
else 

GPR[rd] 4— temp 
endif 


Exceptions: 

Integer Overflow 

Programming Notes: 

ADDU performs the same arithmetic operation but does not trap on overflow. 
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Floating Point Add 


ADD.fmt 


31 26 25 21 20 16 15 11 10 6 5 0 


COP1 





ADD 


fmt 

ft 

fs 

fd 


010001 





000000 


6 5 5 5 5 6 


Format: add.S fd, fs, ft 

ADD . D fd, fs, ft 
ADD. PS fd, fs, ft 


MIPS32 
MIPS32 
MIPS64 
MIPS32 Release 2 


Purpose: 

To add floating point values 

Description: fd <- fs + ft 

The value in FPR/f is added to the value in FPR fs. The result is calculated to infinite precision, rounded by using to 
the current rounding mode in FCSR , and placed into FPR fd. The operands and result are values in format fint. 
ADD. PS adds the upper and lower halves of FPR fs and FPR ft independently, and ORs together any generated 
exceptions. 

Cause bits are ORed into the Flag bits if no exception is taken. 

Restrictions: 

The fields fs,ft, and fd must specify FPRs valid for operands of type Jmt. If they are not valid, the result is UNPRE- 
DICTABLE. 

The operands must be values in format fmt, if they are not, the result is UNPREDICTABLE and the value of the 
operand FPRs becomes UNPREDICTABLE. 

The result of ADD. PS is UNPREDICTABLE if the processor is executing in 16 FP registers mode. 

Operation: 

StoreFPR (fd, fmt, ValueFPR(fs, fmt) + frat ValueFPR(ft, fmt)) 

Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Floating Point Exceptions: 

Unimplemented Operation, Invalid Operation, Inexact, Overflow, Underflow 
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ADDI 


Add Immediate Word 


31 

26 

25 

21 

20 16 

15 

0 

ADDI 

001000 



rs 

rt 

immediate 


6 



5 

5 

16 


Format: 

Purpose: 

ADDI 

rt, 

rs, immediate 


MIPS32 


To add a constant to a 32-bit integer. If overflow occurs, then trap. 

Description: rt 4— rs + immediate 

The 16-bit signed immediate is added to the 32-bit value in GPR rs to produce a 32-bit result. 

• If the addition results in 32-bit 2’s complement arithmetic overflow, the destination register is not modified and 
an Integer Overflow exception occurs. 

• If the addition does not overflow, the 32-bit result is placed into GPR rt. 

Restrictions: 

None 

Operation: 

temp <— (GPR[rs] 31 | | GPR [ rs ] 31 _ _ 0 ) + sign_extend (immediate) 
if temp 32 ^ temp 31 then 

Signal Except ion ( IntegerOverf low) 
else 

GPR[rt] temp 
endif 

Exceptions: 

Integer Overflow 

Programming Notes: 

ADDIU performs the same arithmetic operation but does not trap on overflow. 
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Add Immediate Unsigned Word 


ADDIU 


31 26 25 21 20 16 15 0 



6 5 5 16 


Format: ADDIU rt, rs, immediate MIPS32 

Purpose: 

To add a constant to a 32-bit integer 
Description: rt 4— rs + immediate 

The 16-bit signed immediate is added to the 32-bit value in GPR rs and the 32-bit arithmetic result is placed into 
GPR rt. 

No Integer Overflow exception occurs under any circumstances. 

Restrictions: 

None 

Operation: 

temp <— GPR[rs] + sign_extend (immediate) 

GPR[rt]<— temp 

Exceptions: 

None 

Programming Notes: 

The term “unsigned” in the instruction name is a misnomer; this operation is 32-bit modulo arithmetic that does not 
trap on overflow. This instruction is appropriate for unsigned arithmetic, such as address arithmetic, or integer arith- 
metic environments that ignore overflow, such as C language arithmetic. 
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ADDU 


Add Unsigned Word 


31 26 25 21 20 16 15 11 10 6 5 0 


SPECIAL 




0 

ADDU 


rs 

rt 

rd 



000000 




00000 

100001 


6 5 

5 

5 

5 

6 

Format: addu rd, rs, rt 

Purpose: 

To add 32-bit integers 

Description: rd 4- rs + rt 




MIPS32 


The 32-bit word value in GPR rt is added to the 32-bit value in GPR r,v and the 32-bit arithmetic result is placed into 
GPR rd. 

No Integer Overflow exception occurs under any circumstances. 

Restrictions: 

None 

Operation: 

temp <— GPR[rs] + GPR[rt] 

GPR[rd] 4— temp 

Exceptions: 

None 

Programming Notes: 

The term “unsigned” in the instruction name is a misnomer; this operation is 32-bit modulo arithmetic that does not 
trap on overflow. This instruction is appropriate for unsigned arithmetic, such as address arithmetic, or integer arith- 
metic environments that ignore overflow, such as C language arithmetic. 
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Floating Point Align Variable 


ALNV.PS 


31 26 25 21 20 16 15 11 10 6 5 0 


COP IX 





ALNV.PS 


rs 

ft 

fs 

fd 


010011 





011110 


6 5 5 5 5 6 


Format: alnv.ps fd, fs, ft, rs 


MIPS64 
MIPS32 Release 2 


Purpose: 

To align a misaligned pair of paired single values 


Description: fd ByteAlign (rs 2 0 ' fs, ft) 

FPR fs is concatenated with FPR ft and this value is funnel-shifted by GPR rs 2 g bytes, and written into FPR fd. If 
GPR r,v 2 o i s ))• fd receives fs. If GPR r,v 2 g is 4, the operation depends on the current endianness. 


Figure 3-1 illustrates the following example: for a big-endian operation and a byte alignment of 4, the upper half of fd 
receives the lower half of the paired single value in fs, and the lower half of fd receives the upper half of the paired 
single value in ft. 


Figure 3-1 Example of an ALNV.PS Operation 



The move is nonarithmetic; it causes no IEEE 754 exceptions. 
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Floating Point Align Variable (cont.) 


ALNV.PS 


Restrictions: 

The fields fs, ft, and fd must specify FPRs valid for operands of type PS. If they are not valid, the result is UNPRE- 
DICTABLE. 

If GPR r,v-| q are non-zero, the results are UNPREDICTABLE. 

The result of this instruction is UNPREDICTABLE if the processor is executing in 16 FP registers mode. 

Operation: 

if GPR[rs] 2 ..o = 0 then 

StoreFPR ( f d, PS, ValueFPR ( f s , PS) ) 
else if GPR[rs] 2 ..o ^ 4 then 

UNPREDICTABLE 

else if BigEndianCPU then 

StoreFPR ( f d, PS, ValueFPR(fs, PS) 31--0 || ValueFPR ( ft , PS ) 63 .. 32 ) 
else 

StoreFPR ( f d, PS, ValueFPR(ft, PS) 31--0 ]| ValueFPR (fs, PS) 63 .. 32 ) 
endif 

Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Programming Notes: 

ALNV.PS is designed to be used with LUXC1 to load 8 bytes of data from any 4-byte boundary. For example: 

/* Copy T2 bytes (a multiple of 16) of data TO to Tl, TO unaligned, T1 aligned. 
Reads one dw beyond the end of TO. */ 

LUXC1 FO, 0 (TO) /* set up by reading 1st src dw */ 

LI T3, 0 /* index into src and dst arrays */ 

ADDIU T4, TO, 8 /* base for odd dw loads */ 

ADDIU T5, Tl, -8/* base for odd dw stores */ 

LOOP : 

LUXC1 FI, T3(T4) 

ALNV.PS F2 , FO, FI, TO/* switch FO, FI for little-endian */ 

SDC1 F2 , T3 (Tl ) 

ADDIU T3, T3, 16 

LUXC1 FO, T3 (TO ) 

ALNV.PS F2 , FI, FO, TO/* switch FI, FO for little-endian */ 

BNE T3, T2 , LOOP 

SDC1 F2 , T3(T5) 

DONE : 
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ALNV.PS 


Floating Point Align Variable (cont.) 

ALNV.PS is also useful with SUXC1 to store paired-single results in a vector loop to a possibly misaligned address: 

/* T 1 [ i ] = TO [ i ] + F8, TO aligned, T1 unaligned. */ 

CVT.PS.S F8, F8, F8/* make addend paired-single */ 

/* Loop header computes 1st pair into FO, stores high half if T1 */ 

/* misaligned */ 

LOOP : 


LDCl 

F2, 

T3 ( T4 ) / * get TO [ i+2 ] /TO [ i+3 ] */ 


ADD .PS 

FI, 

F2 , 

F8/* compute T1 [i+2] /T1 [i+3] 

*/ 

ALNV.PS 

F3, 

F0, 

FI, Tl/* align to dst memory 

*/ 

SUXC1 

F3, 

T3(T1)/* store to T1 [ i + 0 ] /T1 [ i + 1 ] 

*/ 

ADDIU 

T3, 

16 

/* i = i + 4 */ 


LDCl 

F2, 

T3 (TO ) /* get TO [i + 0] /TO [i + 1] */ 


ADD .PS 

F0, 

F2 , 

F8/* compute Tl [ i + 0 ] /Tl [ i + 1 ] 

*/ 

ALNV.PS 

F3, 

FI, 

F0, Tl/* align to dst memory 

*/ 

BNE 

T3, 

T2 , 

LOOP 


SUXC1 

F3, 

T3(T5)/* store to T1 [ i+2 ] /T1 [ i + 3 ] 

*/ 


/* Loop trailer stores all or half of FO, depending on T1 alignment */ 
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Format: and rd, rs, rt 

Purpose: 

To do a bitwise logical AND 
Description: rd 4— rs and rt 




MIPS32 


The contents of GPR rs are combined with the contents of GPR rt in a bitwise logical AND operation. The result is 
placed into GPR rd. 

Restrictions: 

None 

Operation: 

GPR [ rd] <- GPR [ rs ] and GPR[rt] 

Exceptions: 

None 
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ANDI 


And Immediate 
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Format: ANDI rt, rs, immediate MIPS32 

Purpose: 

To do a bitwise logical AND with a constant 

Description: rt 4— rs AND immediate 

The 16-bit immediate is zero-extended to the left and combined with the contents of GPR rs in a bitwise logical AND 
operation. The result is placed into GPR rt. 

Restrictions: 

None 

Operation: 

GPR[rt] 4— GPRfrs] and zero_extend (immediate) 

Exceptions: 

None 
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Unconditional Branch 
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Format: b offset Assembly Idiom 

Purpose: 

To do an unconditional branch 

Description: branch 

B offset is the assembly idiom used to denote an unconditional branch. The actual instruction is interpreted by the 
hardware as BEQ rO, rO, offset. 

An 18-bit signed offset (the 16-bit offset field shifted left 2 bits) is added to the address of the instruction following 
the branch (not the branch itself), in the branch delay slot, to form a PC-relative effective target address. 

Restrictions: 

Processor operation is UNPREDICTABLE if a branch, jump, ERET, DERET, or WAIT instruction is placed in the 
delay slot of a branch or jump. 

Operation: 

— 9 

I: target_of f set <— sign_extend (of f set MO) 

1+1 : PC 4— PC + target_of f set 

Exceptions: 

None 

Programming Notes: 

With the 18-bit signed instruction offset, the conditional branch range is ± 128 Kbytes. Use jump (J) or jump register 
(JR) instructions to branch to addresses outside this range. 
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Branch and Link 


BAL 
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Format: bal rs, offset Assembly Idiom 

Purpose: 

To do an unconditional PC-relative procedure call 

Description: procedure_call 

BAL offset is the assembly idiom used to denote an unconditional branch. The actual instruction is iterpreted by the 
hardware as BGEZAL rO, offset. 

Place the return address link in GPR 31. The return link is the address of the second instruction following the branch, 
where execution continues after a procedure call. 

An 18-bit signed offset (the 16-bit offset field shifted left 2 bits) is added to the address of the instruction following 
the branch (not the branch itself), in the branch delay slot, to form a PC-relative effective target address. 

Restrictions: 

Processor operation is UNPREDICTABLE if a branch, jump, ERET, DERET, or WAIT instruction is placed in the 
delay slot of a branch or jump. 

GPR 3 1 must not be used for the source register rs, because such an instruction does not have the same effect when 
reexecuted. The result of executing such an instruction is UNPREDICTABLE. This restriction permits an exception 
handler to resume execution by reexecuting the branch when an exception occurs in the branch delay slot. 

Operation: 

I : 

1 + 1 : 

Exceptions: 

None 

Programming Notes: 

With the 18-bit signed instruction offset, the conditional branch range is ± 128 KBytes. Use jump and link (JAL) or 
jump and link register ( JALR) instructions for procedure calls to addresses outside this range. 


p 

target_of f set 4— sign_extend (of f set I I 0 ) 

GPR [ 31 ] 4- PC + 8 

PC 4— PC + target_of f set 
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Branch on FP False 
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Format: 

BC1F 

offset (cc = 0 implied) 


MIPS32 


BC1F 

cc, offset 


MIPS32 


Purpose: 

To test an FP condition code and do a PC-relative conditional branch 

Description: if cc = 0 then branch 

An 18-bit signed offset (the 16-bit offset field shifted left 2 bits) is added to the address of the instruction following 
the branch (not the branch itself) in the branch delay slot to form a PC-relative effective target address. If the FP con- 
dition code bit CC is false (0), the program branches to the effective target address after the instruction in the delay 
slot is executed. An FP condition code is set by the FP compare instruction, C.cond.fmt. 

Restrictions: 

Processor operation is UNPREDICTABLE if a branch, jump, ERET, DERET, or WAIT instruction is placed in the 
delay slot of a branch or jump. 

Operation: 

This operation specification is for the general Branch On Condition operation with the f/' (true/false) and nd (nullify 
delay slot) fields as variables. The individual instructions BC1F, BC1FL, BC1T, and BC1TL have specific values for 
tf and nd. 

I: condition <— FPConditionCode (cc) = 0 

target_of f set <— (of fset 15 ) GPRLEN_ <16+2) I [ offset | ! 0 2 
1 + 1 : if condition then 

PC <— PC + target_of f set 
endif 
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BC1F 


Branch on FP False (cont.) 


Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Floating Point Exceptions: 

Unimplemented Operation 

Programming Notes: 

With the 18-bit signed instruction offset, the conditional branch range is ± 128 KBytes. Use jump (J) or jump register 
(JR) instructions to branch to addresses outside this range 

Flistorical Information: 

The MIPS I architecture defines a single floating point condition code, implemented as the coprocessor 1 condition 
signal ( CplCond) and the C bit in the FP Control/Status register. MIPS I, II, and III architectures must have the CC 
field set to 0, which is implied by the first format in the “Format” section. 

The MIPS IV and MIPS32 architectures add seven more Condition Code bits to the original condition code 0. FP 
compare and conditional branch instructions specify the Condition Code bit to set or test. Both assembler formats are 
valid for MIPS IV and MIPS32. 

In the MIPS I, II, and III architectures there must be at least one instruction between the compare instruction that sets 
the condition code and the branch instruction that tests it. Hardware does not detect a violation of this restriction. 
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Branch on FP False Likely 
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Format: 

BC1FL 

offset (cc = 0 implied) 


MIPS32 


BC1FL 

cc, offset 


MIPS32 


Purpose: 

To test an FP condition code and make a PC-relative conditional branch; execute the instruction in the delay slot only 
if the branch is taken. 

Description: if cc = 0 then branch_likely 

An 18-bit signed offset (the 16-bit offset field shifted left 2 bits) is added to the address of the instruction following 
the branch (not the branch itself) in the branch delay slot to form a PC-relative effective target address. If the FP Con- 
dition Code bit CC is false (0), the program branches to the effective target address after the instruction in the delay 
slot is executed. If the branch is not taken, the instruction in the delay slot is not executed. 

An FP condition code is set by the FP compare instruction, C.cond.fmt. 

Restrictions: 

Processor operation is UNPREDICTABLE if a branch, jump, ERET, DERET, or WAIT instruction is placed in the 
delay slot of a branch or jump. 

Operation: 

This operation specification is for the general Branch On Condition operation with the //'(true/false) and nd (nullify 
delay slot) fields as variables. The individual instructions BC1F, BC1FL, BC1T, and BC1TL have specific values for 
If and nd. 

I: condition FPConditionCode (cc) = 0 

target_of f set <— (of f set 15 ) GPRLEN ~ (16+2) | | offset I | 0 2 

1 + 1 : if condition then 

PC <— PC + target_of f set 
else 

Null if y Cur rent Inst ruction ( ) 
endif 
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BC1FL 


Branch on FP False Likely (cont.) 


Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Floating Point Exceptions: 

Unimplemented Operation 

Programming Notes: 

With the 18-bit signed instruction offset, the conditional branch range is ± 128 KBytes. Use jump ( J) or jump register 
(JR) instructions to branch to addresses outside this range. 

Software is strongly encouraged to avoid the use of the Branch Likely instructions, as they will be removed from a 
future revision of the MIPS Architecture. 

Some implementations always predict the branch will be taken, so there is a significant penalty if the branch is not 
taken. Software should only use this instruction when there is a very high probability (98% or more) that the branch 
will be taken. If the branch is not likely to be taken or if the probability of a taken branch is unknown, software is 
encouraged to use the BC1F instruction instead. 

Historical Information: 

The MIPS I architecture defines a single floating point condition code, implemented as the coprocessor 1 condition 
signal (CpICond) and the C bit in the FP Control/Status register. MIPS I, II, and III architectures must have the CC 
field set to 0, which is implied by the first format in the “Format” section. 

The MIPS IV and MIPS32 architectures add seven more Condition Code bits to the original condition code 0. FP 
compare and conditional branch instructions specify the Condition Code bit to set or test. Both assembler formats are 
valid for MIPS IV and MIPS32. 

In the MIPS II andlll architectionrs there must be at least one instruction between the compare instruction that 
sets a condition code and the branch instruction that tests it. Flardware does not detect a violation of this restriction. 
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Branch on FP True 
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Format: 

BC1T offset (cc = 
BC1T cc, offset 

0 implied) 


MIPS32 

MIPS32 


Purpose: 

To test an FP condition code and do a PC-relative conditional branch 

Description: if cc = 1 then branch 

An 18-bit signed offset (the 16-bit offset field shifted left 2 bits) is added to the address of the instruction following 
the branch (not the branch itself) in the branch delay slot to form a PC-relative effective target address. If the FP con- 
dition code bit CC is true (1), the program branches to the effective target address after the instruction in the delay slot 
is executed. An FP condition code is set by the FP compare instruction, C.cond.fmt. 

Restrictions: 

Processor operation is UNPREDICTABLE if a branch, jump, ERET, DERET, or WAIT instruction is placed in the 
delay slot of a branch or jump. 

Operation: 

This operation specification is for the general Branch On Condition operation with the f/' (true/false) and nd (nullify 
delay slot) fields as variables. The individual instructions BC1F, BC1FL, BC1T, and BC1TL have specific values for 
tf and nd. 

I: condition <— FPConditionCode (cc) = 1 

target_of f set <— (of fset 15 ) GPRLEN_ <16+2) I [ offset | ! 0 2 
1 + 1 : if condition then 

PC <— PC + target_of f set 
endif 
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BC1T 


Branch on FP True (cont.) 


Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Floating Point Exceptions: 

Unimplemented Operation 

Programming Notes: 

With the 18-bit signed instruction offset, the conditional branch range is ± 128 KBytes. Use jump (J) or jump register 
(JR) instructions to branch to addresses outside this range. 

Historical Information: 

The MIPS I architecture defines a single floating point condition code, implemented as the coprocessor 1 condition 
signal ( CplCond ) and the C bit in the FP Control/Status register. MIPS I, II, and III architectures must have the CC 
field set to 0, which is implied by the first format in the “Format” section. 

The MIPS IV and MIPS32 architectures add seven more Condition Code bits to the original condition code 0. FP 
compare and conditional branch instructions specify the Condition Code bit to set or test. Both assembler formats are 
valid for MIPS IV and MIPS32. 

In the MIPS I, II, and III architectures there must be at least one instruction between the compare instruction that sets 
the condition code and the branch instruction that tests it. Hardware does not detect a violation of this restriction. 
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Branch on FP True Likely 
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Format: 

BC1TL 

offset (cc = 0 implied) 


MIPS32 


BC1TL 

cc, offset 


MIPS32 


Purpose: 

To test an FP condition code and do a PC-relative conditional branch; execute the instruction in the delay slot only if 
the branch is taken. 

Description: if cc = 1 then branch_likely 

An 18-bit signed offset (the 16-bit offset field shifted left 2 bits) is added to the address of the instruction following 
the branch (not the branch itself) in the branch delay slot to form a PC-relative effective target address. If the FP Con- 
dition Code bit CC is true (1), the program branches to the effective target address after the instruction in the delay 
slot is executed. If the branch is not taken, the instruction in the delay slot is not executed. 

An FP condition code is set by the FP compare instruction, C.cond.fmt. 

Restrictions: 

Processor operation is UNPREDICTABLE if a branch, jump, ERET, DERET, or WAIT instruction is placed in the 
delay slot of a branch or jump. 

Operation: 

This operation specification is for the general Branch On Condition operation with the //'(true/false) and nd (nullify 
delay slot) fields as variables. The individual instructions BC1F, BC1FL, BC1T, and BC1TL have specific values for 
tf and nd. 

I: condition FPConditionCode (cc) = 1 

target_of f set <— (of f set 15 ) GPRLEN ~ (16+2) | | offset I | 0 2 

1+1 : if condition then 

PC <— PC + target_of f set 
else 

Null if y Cur rent Inst ruction ( ) 
endif 
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BC1TL 


Branch on FP True Likely (cont.) 


Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Floating Point Exceptions: 

Unimplemented Operation 

Programming Notes: 

With the 18-bit signed instruction offset, the conditional branch range is ± 128 KBytes. Use jump ( J) or jump register 
(JR) instructions to branch to addresses outside this range. 

Software is strongly encouraged to avoid the use of the Branch Likely instructions, as they will be removed from a 
future revision of the MIPS Architecture. 

Some implementations always predict the branch will be taken, so there is a significant penalty if the branch is not 
taken. Software should only use this instruction when there is a very high probability (98% or more) that the branch 
will be taken. If the branch is not likely to be taken or if the probability of a taken branch is unknown, software is 
encouraged to use the BC1T instruction instead. 

Historical Information: 

The MIPS I architecture defines a single floating point condition code, implemented as the coprocessor 1 condition 
signal (CpICond) and the C bit in the FP Control/Status register. MIPS I, II, and III architectures must have the CC 
field set to 0, which is implied by the first format in the “Format” section. 

The MIPS IV and MIPS32 architectures add seven more Condition Code bits to the original condition code 0. FP 
compare and conditional branch instructions specify the Condition Code bit to set or test. Both assembler formats are 
valid for MIPS IV and MIPS32. 

In the MIPS II andlll architectionrs there must be at least one instruction between the compare instruction that 
sets a condition code and the branch instruction that tests it. Hardware does not detect a violation of this restriction. 
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Format: 

BC2F 

offset (cc = 0 implied) 


MIPS32 


BC2F 

cc, offset 


MIPS32 


Purpose: 

To test a COP2 condition code and do a PC-relative conditional branch 

Description: if cc = 0 then branch 

An 18-bit signed offset (the 16-bit offset field shifted left 2 bits) is added to the address of the instruction following 
the branch (not the branch itself) in the branch delay slot to form a PC-relative effective target address. If the COP2 
condition specified by CC is false (0), the program branches to the effective target address after the instruction in the 
delay slot is executed. 

Restrictions: 

Processor operation is UNPREDICTABLE if a branch, jump, ERET, DERET, or WAIT instruction is placed in the 
delay slot of a branch or jump. 

Operation: 

This operation specification is for the general Branch On Condition operation with the (/'(true/false) and nd (nullify 
delay slot) fields as variables. The individual instructions BC2F, BC2FL, BC2T, and BC2TL have specific values for 
tf and nd. 

I: condition <— COP2Condition (cc) = 0 

target_of f set (of f set 15 ) GPRLEN_ <16+2) I I offset | ! 0 2 

1 + 1 : if condition then 

PC <— PC + target_of f set 
endif 


Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Programming Notes: 

With the 18-bit signed instruction offset, the conditional branch range is ± 128 KBytes. Use jump (J) or jump register 
(JR) instructions to branch to addresses outside this range. 
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Format: 

BC2FL 

offset (cc = 0 implied) 


MIPS32 


BC2FL 

cc, offset 


MIPS32 


Purpose: 

To test a COP2 condition code and make a PC -relative conditional branch; execute the instruction in the delay slot 
only if the branch is taken. 

Description: if cc = 0 then branch_likely 

An 18-bit signed offset (the 16-bit offset field shifted left 2 bits) is added to the address of the instruction following 
the branch (not the branch itself) in the branch delay slot to form a PC-relative effective target address. If the COP2 
condition specified by CC is false (0), the program branches to the effective target address after the instruction in the 
delay slot is executed. If the branch is not taken, the instruction in the delay slot is not executed. 

Restrictions: 

Processor operation is UNPREDICTABLE if a branch, jump, ERET, DERET, or WAIT instruction is placed in the 
delay slot of a branch or jump. 

Operation: 

This operation specification is for the general Branch On Condition operation with the //'(true/false) and nd (nullify 
delay slot) fields as variables. The individual instructions BC2F, BC2FL, BC2T, and BC2TL have specific values for 
tf and nd. 

I: condition COP2Condition (cc) = 0 

target_of f set <— (of fset 15 ) GPRLEN_ <16+2) | | offset | | 0 2 

1 + 1 : if condition then 

PC <— PC + target_of f set 
else 

NullifyCurrentlnstruction () 
endif 
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Branch on COP2 False Likely (cont.) 


BC2FL 


Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Programming Notes: 

With the 18-bit signed instruction offset, the conditional branch range is ± 128 KBytes. Use jump (J) or jump register 
(JR) instructions to branch to addresses outside this range. 

Software is strongly encouraged to avoid the use of the Branch Likely instructions, as they will be removed from a 
future revision of the MIPS Architecture. 

Some implementations always predict the branch will be taken, so there is a significant penalty if the branch is not 
taken. Software should only use this instruction when there is a very high probability (98% or more) that the branch 
will be taken. If the branch is not likely to be taken or if the probability of a taken branch is unknown, software is 
encouraged to use the BC2F instruction instead. 
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Format: 

BC2T offset (cc = 
BC2T cc, offset 

0 implied) 


MIPS32 

MIPS32 


Purpose: 

To test a COP2 condition code and do a PC-relative conditional branch 

Description: if cc = 1 then branch 

An 18-bit signed offset (the 16-bit offset field shifted left 2 bits) is added to the address of the instruction following 
the branch (not the branch itself) in the branch delay slot to form a PC-relative effective target address. If the COP2 
condition specified by CC is true (1), the program branches to the effective target address after the instruction in the 
delay slot is executed. 

Restrictions: 

Processor operation is UNPREDICTABLE if a branch, jump, ERET, DERET, or WAIT instruction is placed in the 
delay slot of a branch or jump. 

Operation: 

This operation specification is for the general Branch On Condition operation with the (/'(true/false) and nd (nullify 
delay slot) fields as variables. The individual instructions BC2F, BC2FL, BC2T, and BC2TL have specific values for 
tf and nd. 

I: condition <— COP2Condition (cc) = 1 

target_of f set (of f set 15 ) GPRLEN_ <16+2) I I offset | ! 0 2 

1 + 1 : if condition then 

PC <— PC + target_of f set 
endif 


Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Programming Notes: 

With the 18-bit signed instruction offset, the conditional branch range is ± 128 KBytes. Use jump (J) or jump register 
(JR) instructions to branch to addresses outside this range. 
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Format: 

BC2TL 

offset (cc = 0 implied) 


MIPS32 


BC2TL 

cc, offset 


MIPS32 


Purpose: 

To test a COP2 condition code and do a PC-relative conditional branch; execute the instruction in the delay slot only 
if the branch is taken. 

Description: if cc = 1 then branch_likely 

An 18-bit signed offset (the 16-bit offset field shifted left 2 bits) is added to the address of the instruction following 
the branch (not the branch itself) in the branch delay slot to form a PC-relative effective target address. If the COP2 
condition specified by CC is true (1), the program branches to the effective target address after the instruction in the 
delay slot is executed. If the branch is not taken, the instruction in the delay slot is not executed. 

Restrictions: 

Processor operation is UNPREDICTABLE if a branch, jump, ERET, DERET, or WAIT instruction is placed in the 
delay slot of a branch or jump. 

Operation: 

This operation specification is for the general Branch On Condition operation with the //'(true/false) and nd (nullify 
delay slot) fields as variables. The individual instructions BC2F, BC2FL, BC2T, and BC2TL have specific values for 
tf and nd. 

I: condition COP2Condition (cc) = 1 

target_of f set <— (of fset 15 ) GPRLEN_ <16+2) I I offset | | 0 2 
1+1 : if condition then 

PC <— PC + target_of f set 
else 

NullifyCurrentlnstruction () 
endif 
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Branch on COP2 True Likely (cont.) 


BC2TL 


Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Programming Notes: 

With the 18-bit signed instruction offset, the conditional branch range is ± 128 KBytes. Use jump (J) or jump register 
(JR) instructions to branch to addresses outside this range. 

Software is strongly encouraged to avoid the use of the Branch Likely instructions, as they will be removed from a 
future revision of the MIPS Architecture. 

Some implementations always predict the branch will be taken, so there is a significant penalty if the branch is not 
taken. Software should only use this instruction when there is a very high probability (98% or more) that the branch 
will be taken. If the branch is not likely to be taken or if the probability of a taken branch is unknown, software is 
encouraged to use the BC2T instruction instead. 
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Branch on Equal BEQ 


31 

26 

25 

21 

20 

16 

15 

0 

BEQ 

000100 

rs 

rt 

offset 

6 



5 


5 

16 


Format: 

BEQ 

rs. 

rt, offset 




MIPS32 


Purpose: 

To compare GPRs then do a PC -relative conditional branch 

Description: if rs = rt then branch 

An 18-bit signed offset (the 16-bit offset field shifted left 2 bits) is added to the address of the instruction following 
the branch (not the branch itself), in the branch delay slot, to form a PC-relative effective target address. 

If the contents of GPR rs and GPR rt are equal, branch to the effective target address after the instruction in the delay 
slot is executed. 

Restrictions: 

Processor operation is UNPREDICTABLE if a branch, jump, ERET, DERET, or WAIT instruction is placed in the 
delay slot of a branch or jump. 

Operation: 

— 9 

I: target_of f set <— sign_extend (of f set NO) 

condition 4— (GPR[rs] = GPR[rt]) 

1+1 : if condition then 

PC <— PC + target_of f set 
endif 


Exceptions: 

None 

Programming Notes: 

With the 18-bit signed instruction offset, the conditional branch range is ± 128 Kbytes. Use jump (J) or jump register 
(JR) instructions to branch to addresses outside this range. 

BEQ rO, rO offset, expressed as B offset, is the assembly idiom used to denote an unconditional branch. 
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Branch on Equal Likely 


BEQL 


31 


26 

25 


21 

20 

16 

15 

0 

BEQL 

010100 

rs 

rt 

offset 


6 



5 



5 

16 



Format: beql rs, rt, offset MIPS32 

Purpose: 

To compare GPRs then do a PC -relative conditional branch; execute the delay slot only if the branch is taken. 

Description: if rs = rt then branch_likely 

An 18-bit signed offset (the 16-bit offset field shifted left 2 bits) is added to the address of the instruction following 
the branch (not the branch itself), in the branch delay slot, to form a PC-relative effective target address. 

If the contents of GPR rs and GPR rt are equal, branch to the target address after the instruction in the delay slot is 
executed. If the branch is not taken, the instruction in the delay slot is not executed. 

Restrictions: 

Processor operation is UNPREDICTABLE if a branch, jump, ERET, DERET, or WAIT instruction is placed in the 
delay slot of a branch or jump. 

Operation: 

— 9 

I: target_of f set <— sign_extend (of f set NO) 

condition 4— (GPR[rs] = GPR[rt]) 

1+1 : if condition then 

PC <— PC + target_of f set 
else 

NullifyCurrentlnstruction ( ) 
endif 

Exceptions: 

None 
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Branch on Equal Likely (cont.) 


BEQL 


Programming Notes: 

With the 18-bit signed instruction offset, the conditional branch range is ± 128 KBytes. Use jump (J) or jump register 
(JR) instructions to branch to addresses outside this range. 

Software is strongly encouraged to avoid the use of the Branch Likely instructions, as they will be removed from a 
future revision of the MIPS Architecture. 

Some implementations always predict the branch will be taken, so there is a significant penalty if the branch is not 
taken. Software should only use this instruction when there is a very high probability (98% or more) that the branch 
will be taken. If the branch is not likely to be taken or if the probability of a taken branch is unknown, software is 
encouraged to use the BEQ instruction instead. 

Historical Information: 

In the MIPS I architecture, this instruction signaled a Reserved Instruction Exception. 
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Branch on Greater Than or Equal to Zero 


BGEZ 


31 26 25 21 20 16 15 0 


REGIMM 


BGEZ 



rs 


offset 

000001 


00001 



6 5 

5 

16 


Format: bgez rs, offset 



MIPS32 

Purpose: 





To test a GPR then do a PC-relative conditional branch 

Description: if rs > 0 then branch 

An 18-bit signed offset (the 16-bit offset field shifted left 2 bits) is added to the address of the instruction following 
the branch (not the branch itself), in the branch delay slot, to form a PC-relative effective target address. 

If the contents of GPR rs are greater than or equal to zero (sign bit is 0), branch to the effective target address after the 
instruction in the delay slot is executed. 

Restrictions: 

Processor operation is UNPREDICTABLE if a branch, jump, ERET, DERET, or WAIT instruction is placed in the 
delay slot of a branch or jump. 

Operation: 

— 9 

I: target_of f set <— sign_extend (of f set MO) 

condition <— GPRfrs] > o GPRLEN 
1+1: if condition then 

PC <— PC + target_of f set 
endif 

Exceptions: 

None 

Programming Notes: 

With the 18-bit signed instmction offset, the conditional branch range is ± 128 KBytes. Use jump (J) or jump register 
(JR) instructions to branch to addresses outside this range. 
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Branch on Greater Than or Equal to Zero and Link 


BGEZAL 


31 26 25 21 20 16 15 0 


REGIMM 


BGEZAL 



rs 


offset 

000001 


10001 
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5 

16 


Format: bgezal rs, offset 



MIPS32 

Purpose: 





To test a GPR then do a PC-relative conditional procedure call 

Description: if rs > 0 then procedure_call 

Place the return address link in GPR 31. The return link is the address of the second instruction following the branch, 
where execution continues after a procedure call. 

An 18-bit signed offset (the 16-bit offset field shifted left 2 bits) is added to the address of the instruction following 
the branch (not the branch itself), in the branch delay slot, to form a PC-relative effective target address. 

If the contents of GPR rs are greater than or equal to zero (sign bit is 0), branch to the effective target address after the 
instruction in the delay slot is executed. 

Restrictions: 

Processor operation is UNPREDICTABLE if a branch, jump, ERET, DERET, or WAIT instruction is placed in the 
delay slot of a branch or jump. 

GPR 3 1 must not be used for the source register rs, because such an instruction does not have the same effect when 
reexecuted. The result of executing such an instruction is UNPREDICTABLE. This restriction permits an exception 
handler to resume execution by reexecuting the branch when an exception occurs in the branch delay slot. 

Operation: 

2 

I: target_of f set <— sign_extend (of f set I I 0 ) 

condition e- GPR[rs] > 0 GPRLEN 
GPR [ 31 ] <r- PC + 8 
1+1 : if condition then 

PC <— PC + target_of f set 
endif 

Exceptions: 

None 

Programming Notes: 

With the 18-bit signed instruction offset, the conditional branch range is ± 128 KBytes. Use jump and link (JAL) or 
jump and link register (JALR) instructions for procedure calls to addresses outside this range. 

BGEZAL rO, offset, expressed as BAL offset, is the assembly idiom used to denote a PC -relative branch and link. 
BAL is used in a manner similar to JAL, but provides PC-relative addressing and a more limited target PC range. 
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Branch on Greater Than or Equal to Zero and Link Likely 


BGEZALL 


31 26 25 21 20 16 15 0 


REGIMM 


BGEZALL 



rs 


offset 

000001 


10011 



6 5 

5 

16 


Eormat: bgezall rs, offset 



MIPS32 

Purpose: 





To test a GPR then do a PC-relative conditional procedure call; execute the delay slot only if the branch is taken. 

Description: if rs > 0 then procedure_call_likely 

Place the return address link in GPR 31. The return link is the address of the second instruction following the branch, 
where execution continues after a procedure call. 

An 18-bit signed offset (the 16-bit offset field shifted left 2 bits) is added to the address of the instruction following 
the branch (not the branch itself), in the branch delay slot, to form a PC-relative effective target address. 

If the contents of GPR rs are greater than or equal to zero (sign bit is 0), branch to the effective target address after the 
instruction in the delay slot is executed. If the branch is not taken, the instruction in the delay slot is not executed. 

Restrictions: 

GPR 3 1 must not be used for the source register rs, because such an instruction does not have the same effect when 
reexecuted. The result of executing such an instruction is UNPREDICTABLE. This restriction permits an exception 
handler to resume execution by reexecuting the branch when an exception occurs in the branch delay slot. 

Processor operation is UNPREDICTABLE if a branch, jump, ERET, DERET, or WAIT instruction is placed in the 
delay slot of a branch or jump. 

Operation: 

p 

I: target_of f set <— sign_extend (of f set I I 0 ) 

condition e- GPR[rs] > 0 GPRLEN 
GPR [ 31 ] <r- PC + 8 
1+1 : if condition then 

PC <— PC + target_of f set 
else 

Null if y Cur rent Inst ruction ( ) 
endif 

Exceptions: 

None 
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Branch on Greater Than or Equal to Zero and Link Likely (con't.) 


BGEZALL 


Programming Notes: 

With the 18-bit signed instruction offset, the conditional branch range is ± 128 KBytes. Use jump and link (JAL) or 
jump and link register ( JALR) instmctions for procedure calls to addresses outside this range. 

Software is strongly encouraged to avoid the use of the Branch Likely instructions, as they will be removed from a 
future revision of the MIPS Architecture. 

Some implementations always predict the branch will be taken, so there is a significant penalty if the branch is not 
taken. Software should only use this instruction when there is a very high probability (98% or more) that the branch 
will be taken. If the branch is not likely to be taken or if the probability of a taken branch is unknown, software is 
encouraged to use the BGEZAL instruction instead. 

Historical Information: 

In the MIPS I architecture, this instruction signaled a Reserved Instruction Exception. 
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Branch on Greater Than or Equal to Zero Likely 


BGEZL 


31 26 25 21 20 16 15 0 
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rs 


offset 
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16 


Format: bgezl rs, offset 



MIPS32 

Purpose: 





To test a GPR then do a PC-relative conditional branch; execute the delay slot only if the branch is taken. 

Description: if rs > 0 then branch_likely 

An 18-bit signed offset (the 16-bit offset field shifted left 2 bits) is added to the address of the instruction following 
the branch (not the branch itself), in the branch delay slot, to form a PC-relative effective target address. 

If the contents of GPR rs are greater than or equal to zero (sign bit is 0), branch to the effective target address after the 
instruction in the delay slot is executed. If the branch is not taken, the instruction in the delay slot is not executed. 

Restrictions: 

Processor operation is UNPREDICTABLE if a branch, jump, ERET, DERET, or WAIT instruction is placed in the 
delay slot of a branch or jump. 

Operation: 

— 9 

I: target_of f set <— sign_extend (of f set MO) 

condition <— GPR[rs] > o GPRLEN 
1 + 1 : if condition then 

PC <— PC + target_of f set 
else 

Null if y Cur rent Inst ruction ( ) 
endif 


Exceptions: 

None 
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Branch on Greater Than or Equal to Zero Likely (cont.) 


BGEZL 


Programming Notes: 

With the 18-bit signed instmction offset, the conditional branch range is ± 128 KBytes. Use jump (J) or jump register 
(JR) instructions to branch to addresses outside this range. 

Software is strongly encouraged to avoid the use of the Branch Likely instructions, as they will be removed from a 
future revision of the MIPS Architecture. 

Some implementations always predict the branch will be taken, so there is a significant penalty if the branch is not 
taken. Software should only use this instruction when there is a very high probability (98% or more) that the branch 
will be taken. If the branch is not likely to be taken or if the probability of a taken branch is unknown, software is 
encouraged to use the BGEZ instruction instead. 

Historical Information: 

In the MIPS I architecture, this instruction signaled a Reserved Instruction Exception. 
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Branch on Greater Than Zero 


BGTZ 
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Format: bgtz rs, offset 



MIPS32 

Purpose: 





To test a GPR then do a PC-relative conditional branch 

Description: if rs > 0 then branch 

An 18-bit signed offset (the 16-bit offset field shifted left 2 bits) is added to the address of the instruction following 
the branch (not the branch itself), in the branch delay slot, to form a PC-relative effective target address. 

If the contents of GPR rs are greater than zero (sign bit is 0 but value not zero), branch to the effective target address 
after the instruction in the delay slot is executed. 

Restrictions: 

Processor operation is UNPREDICTABLE if a branch, jump, ERET, DERET, or WAIT instruction is placed in the 
delay slot of a branch or jump. 

Operation: 

— 9 

I: target_of f set <— sign_extend (of f set MO) 

condition <— GPRfrs] > o GPRLEN 
1 + 1 : if condition then 

PC <— PC + target_of f set 
endif 

Exceptions: 

None 

Programming Notes: 

With the 18-bit signed instruction offset, the conditional branch range is ± 128 KBytes. Use jump (J) or jump register 
(JR) instructions to branch to addresses outside this range. 
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Branch on Greater Than Zero Likely 


BGTZL 


31 26 25 21 20 16 15 0 


BGTZL 


0 



rs 


offset 

010111 


00000 



6 5 

5 

16 


Format: bgtzl rs, offset 



MIPS32 

Purpose: 





To test a GPR then do a PC-relative conditional branch; execute the delay slot only if the branch is taken. 

Description: if rs > 0 then branch_likely 

An 18-bit signed offset (the 16-bit offset field shifted left 2 bits) is added to the address of the instruction following 
the branch (not the branch itself), in the branch delay slot, to form a PC-relative effective target address. 

If the contents of GPR rs are greater than zero (sign bit is 0 but value not zero), branch to the effective target address 
after the instruction in the delay slot is executed. If the branch is not taken, the instruction in the delay slot is not exe- 
cuted. 

Restrictions: 

Processor operation is UNPREDICTABLE if a branch, jump, ERET, DERET, or WAIT instruction is placed in the 
delay slot of a branch or jump. 

Operation: 

p 

I: target_of f set <— sign_extend (of f set | | 0 ) 

condition 4- GPR[rs] > o GPRLEN 
1+1 : if condition then 

PC <— PC + target_of f set 
else 

Null if y Cur rent Inst ruction ( ) 
endif 


Exceptions: 

None 
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Branch on Greater Than Zero Likely (cont.) 


BGTZL 


Programming Notes: 

With the 18-bit signed instruction offset, the conditional branch range is ± 128 KBytes. Use jump (J) or jump register 
(JR) instructions to branch to addresses outside this range. 

Software is strongly encouraged to avoid the use of the Branch Likely instructions, as they will be removed from a 
future revision of the MIPS Architecture. 

Some implementations always predict the branch will be taken, so there is a significant penalty if the branch is not 
taken. Software should only use this instruction when there is a very high probability (98% or more) that the branch 
will be taken. If the branch is not likely to be taken or if the probability of a taken branch is unknown, software is 
encouraged to use the BGTZ instruction instead. 

Historical Information: 

In the MIPS I architecture, this instruction signaled a Reserved Instruction Exception. 
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Branch on Less Than or Equal to Zero 


BLEZ 
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Format: blez rs, offset 



MIPS32 

Purpose: 





To test a GPR then do a PC-relative conditional branch 

Description: if rs < 0 then branch 

An 18-bit signed offset (the 16-bit offset field shifted left 2 bits) is added to the address of the instruction following 
the branch (not the branch itself), in the branch delay slot, to form a PC-relative effective target address. 

If the contents of GPR rs are less than or equal to zero (sign bit is 1 or value is zero), branch to the effective target 
address after the instruction in the delay slot is executed. 

Restrictions: 

Processor operation is UNPREDICTABLE if a branch, jump, ERET, DERET, or WAIT instruction is placed in the 
delay slot of a branch or jump. 

Operation: 

— 9 

I: target_of f set <— sign_extend (of f set MO) 

condition <— GPR[rs] < o GPRLEN 
1+1: if condition then 

PC <— PC + target_of f set 
endif 

Exceptions: 

None 

Programming Notes: 

With the 18-bit signed instruction offset, the conditional branch range is ± 128 KBytes. Use jump (J) or jump register 
(JR) instructions to branch to addresses outside this range. 


72 


MIPS32™ Architecture For Programmers Volume II, Revision 2.00 


Copyright © 2001-2003 MIPS Technologies Inc. All rights reserved. 




BLEZL 


Branch on Less Than or Equal to Zero Likely 


31 26 25 21 20 16 15 0 
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offset 
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5 

16 


Format: blezl rs, offset 



MIPS32 

Purpose: 





To test a GPR then do a PC-relative conditional branch; execute the delay slot only if the branch is taken. 

Description: if rs < 0 then branch_likely 

An 18-bit signed offset (the 16-bit offset field shifted left 2 bits) is added to the address of the instruction following 
the branch (not the branch itself), in the branch delay slot, to form a PC-relative effective target address. 

If the contents of GPR rs are less than or equal to zero (sign bit is 1 or value is zero), branch to the effective target 
address after the instruction in the delay slot is executed. If the branch is not taken, the instruction in the delay slot is 
not executed. 

Restrictions: 

Processor operation is UNPREDICTABLE if a branch, jump, ERET, DERET, or WAIT instruction is placed in the 
delay slot of a branch or jump. 

Operation: 

p 

I: target_of f set <— sign_extend (of f set | | 0 ) 

condition 4- GPR[rs] < o GPRLEN 
1 + 1 : if condition then 

PC <— PC + target_of f set 
else 

Null if y Cur rent Inst ruction ( ) 
endif 

Exceptions: 

None 
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BLEZL 


Branch on Less Than or Equal to Zero Likely (cont.) 


Programming Notes: 

With the 18-bit signed instruction offset, the conditional branch range is ± 128 KBytes. Use jump (J) or jump register 
(JR) instructions to branch to addresses outside this range. 

Software is strongly encouraged to avoid the use of the Branch Likely instructions, as they will be removed from a 
future revision of the MIPS Architecture. 

Some implementations always predict the branch will be taken, so there is a significant penalty if the branch is not 
taken. Software should only use this instruction when there is a very high probability (98% or more) that the branch 
will be taken. If the branch is not likely to be taken or if the probability of a taken branch is unknown, software is 
encouraged to use the BLEZ instruction instead. 

Historical Information: 

In the MIPS I architecture, this instruction signaled a Reserved Instruction Exception. 
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BLTZ 


Branch on Less Than Zero 


31 26 25 21 20 16 15 0 


REGIMM 


BLTZ 



rs 


offset 
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00000 
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16 


Format: bltz rs, offset 



MIPS32 

Purpose: 





To test a GPR then do a PC-relative conditional branch 

Description: if rs < 0 then branch 

An 18-bit signed offset (the 16-bit offset field shifted left 2 bits) is added to the address of the instruction following 
the branch (not the branch itself), in the branch delay slot, to form a PC-relative effective target address. 

If the contents of GPR rs are less than zero (sign bit is 1), branch to the effective target address after the instruction in 
the delay slot is executed. 

Restrictions: 

Processor operation is UNPREDICTABLE if a branch, jump, ERET, DERET, or WAIT instruction is placed in the 
delay slot of a branch or jump. 

Operation: 

o 

I: target_of f set <— sign_extend (of f set I | 0 ) 

condition <— GPR[rs] < o GPRLEN 
1 + 1 : if condition then 

PC <— PC + target_of f set 
endif 

Exceptions: 

None 

Programming Notes: 

With the 18-bit signed instruction offset, the conditional branch range is ± 128 KBytes. Use jump and link (JAL) or 
jump and link register (JALR) instructions for procedure calls to addresses outside this range. 
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BLTZAL 


Branch on Less Than Zero and Link 


31 26 25 21 20 16 15 0 


REGIMM 


BLTZAL 



rs 


offset 
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Format: bltzal rs, offset 



MIPS32 

Purpose: 





To test a GPR then do a PC-relative conditional procedure call 

Description: if rs < 0 then procedure_call 

Place the return address link in GPR 31. The return link is the address of the second instruction following the branch, 
where execution continues after a procedure call. 

An 18-bit signed offset (the 16-bit offset field shifted left 2 bits) is added to the address of the instruction following 
the branch (not the branch itself), in the branch delay slot, to form a PC-relative effective target address. 

If the contents of GPR rs are less than zero (sign bit is 1), branch to the effective target address after the instruction in 
the delay slot is executed. 

Restrictions: 

GPR 3 1 must not be used for the source register rs, because such an instruction does not have the same effect when 
reexecuted. The result of executing such an instruction is UNPREDICTABLE. This restriction permits an exception 
handler to resume execution by reexecuting the branch when an exception occurs in the branch delay slot. 

Processor operation is UNPREDICTABLE if a branch, jump, ERET, DERET, or WAIT instruction is placed in the 
delay slot of a branch or jump. 

Operation: 

2 

I: target_of f set <— sign_extend (of f set I I 0 ) 

condition e- GPR[rs] < 0 GPRLEN 
GPR [ 31 ] <r- PC + 8 
1+1 : if condition then 

PC <— PC + target_of f set 
endif 

Exceptions: 

None 

Programming Notes: 

With the 18-bit signed instruction offset, the conditional branch range is ± 128 KBytes. Use jump and link (JAL) or 
jump and link register (JALR) instructions for procedure calls to addresses outside this range. 
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BLTZALL 


Branch on Less Than Zero and Link Likely 
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Format: bltzall rs, offset 



MIPS32 

Purpose: 





To test a GPR then do a PC-relative conditional procedure call; execute the delay slot only if the branch is taken. 

Description: if rs < 0 then procedure_call_likely 

Place the return address link in GPR 31. The return link is the address of the second instruction following the branch, 
where execution continues after a procedure call. 

An 18-bit signed offset (the 16-bit offset field shifted left 2 bits) is added to the address of the instruction following 
the branch (not the branch itself), in the branch delay slot, to form a PC-relative effective target address. 

If the contents of GPR rs are less than zero (sign bit is 1), branch to the effective target address after the instruction in 
the delay slot is executed. If the branch is not taken, the instruction in the delay slot is not executed. 

Restrictions: 

GPR 3 1 must not be used for the source register rs, because such an instruction does not have the same effect when 
reexecuted. The result of executing such an instruction is UNPREDICTABLE. This restriction permits an exception 
handler to resume execution by reexecuting the branch when an exception occurs in the branch delay slot. 

Processor operation is UNPREDICTABLE if a branch, jump, ERET, DERET, or WAIT instruction is placed in the 
delay slot of a branch or jump. 

Operation: 

p 

I: target_of f set <— sign_extend (of f set I I 0 ) 

condition e- GPR[rs] < 0 GPRLEN 
GPR [ 31 ] <r- PC + 8 
1+1 : if condition then 

PC <— PC + target_of f set 
else 

Null if y Cur rent Inst ruction ( ) 
endif 

Exceptions: 

None 
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BLTZALL 


Branch on Less Than Zero and Link Likely (cont.) 


Programming Notes: 

With the 18-bit signed instruction offset, the conditional branch range is ± 128 KBytes. Use jump and link (JAL) or 
jump and link register ( JALR) instmctions for procedure calls to addresses outside this range. 

Software is strongly encouraged to avoid the use of the Branch Likely instructions, as they will be removed from a 
future revision of the MIPS Architecture. 

Some implementations always predict the branch will be taken, so there is a significant penalty if the branch is not 
taken. Software should only use this instruction when there is a very high probability (98% or more) that the branch 
will be taken. If the branch is not likely to be taken or if the probability of a taken branch is unknown, software is 
encouraged to use the BLTZAL instruction instead. 

Historical Information: 

In the MIPS I architecture, this instruction signaled a Reserved Instruction Exception. 


78 


MIPS32™ Architecture For Programmers Volume II, Revision 2.00 


Copyright © 2001-2003 MIPS Technologies Inc. All rights reserved. 




BLTZL 


Branch on Less Than Zero Likely 


31 26 25 21 20 16 15 0 


REGIMM 


BLTZL 



rs 


offset 
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00010 
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16 


Format: bltzl rs, offset 



MIPS32 

Purpose: 





To test a GPR then do a PC-relative conditional branch; execute the delay slot only if the branch is taken. 

Description: if rs < 0 then branch_likely 

An 18-bit signed offset (the 16-bit offset field shifted left 2 bits) is added to the address of the instruction following 
the branch (not the branch itself), in the branch delay slot, to form a PC-relative effective target address. 

If the contents of GPR rs are less than zero (sign bit is 1), branch to the effective target address after the instruction in 
the delay slot is executed. If the branch is not taken, the instruction in the delay slot is not executed. 

Restrictions: 

Processor operation is UNPREDICTABLE if a branch, jump, ERET, DERET, or WAIT instruction is placed in the 
delay slot of a branch or jump. 

Operation: 

— 9 

I: target_of f set <— sign_extend (of f set MO) 

condition <— GPR[rs] < o GPRLEN 
1 + 1 : if condition then 

PC <— PC + target_of f set 
else 

Null if y Cur rent Inst ruction ( ) 
endif 


Exceptions: 

None 
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BLTZL 


Branch on Less Than Zero Likely (cont.) 


Programming Notes: 

With the 18-bit signed instruction offset, the conditional branch range is ± 128 KBytes. Use jump (J) or jump register 
(JR) instructions to branch to addresses outside this range. 

Software is strongly encouraged to avoid the use of the Branch Likely instructions, as they will be removed from a 
future revision of the MIPS Architecture. 

Some implementations always predict the branch will be taken, so there is a significant penalty if the branch is not 
taken. Software should only use this instruction when there is a very high probability (98% or more) that the branch 
will be taken. If the branch is not likely to be taken or if the probability of a taken branch is unknown, software is 
encouraged to use the BLTZ instruction instead. 

Historical Information: 

In the MIPS I architecture, this instruction signaled a Reserved Instruction Exception. 
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Branch on Not Equal 


BNE 


31 26 25 21 20 16 15 


0 


BNE 

000101 


rs 


rt 


offset 


6 


5 


5 


16 


Format: bne rs, rt, offset MIPS32 

Purpose: 

To compare GPRs then do a PC -relative conditional branch 

Description: if rs ^ rt then branch 

An 18-bit signed offset (the 16-bit offset field shifted left 2 bits) is added to the address of the instruction following 
the branch (not the branch itself), in the branch delay slot, to form a PC-relative effective target address. 

If the contents of GPR rs and GPR rt are not equal, branch to the effective target address after the instruction in the 
delay slot is executed. 

Restrictions: 

Processor operation is UNPREDICTABLE if a branch, jump, ERET, DERET, or WAIT instruction is placed in the 
delay slot of a branch or jump. 

Operation: 

— 9 

I: target_of f set <— sign_extend (of f set NO) 

condition <— (GPR[rs] ^ GPR[rt]) 

1+1 : if condition then 

PC <— PC + target_of f set 
endif 

Exceptions: 

None 

Programming Notes: 

With the 18-bit signed instruction offset, the conditional branch range is ± 128 KBytes. Use jump (J) or jump register 
(JR) instructions to branch to addresses outside this range. 
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BNEL 


Branch on Not Equal Likely 


31 

26 

25 

21 

20 

16 

15 

0 

BNEL 

010101 

rs 

rt 

offset 

6 



5 


5 

16 


Format: 

BNEL 

rs, 

rt, offset 



MIPS32 

Purpose: 









To compare GPRs then do a PC -relative conditional branch; execute the delay slot only if the branch is taken. 

Description: if rs rt then branch_likely 

An 18-bit signed offset (the 16-bit offset field shifted left 2 bits) is added to the address of the instruction following 
the branch (not the branch itself), in the branch delay slot, to form a PC-relative effective target address. 

If the contents of GPR rs and GPR rt are not equal, branch to the effective target address after the instruction in the 
delay slot is executed. If the branch is not taken, the instruction in the delay slot is not executed. 

Restrictions: 

Processor operation is UNPREDICTABLE if a branch, jump, ERET, DERET, or WAIT instruction is placed in the 
delay slot of a branch or jump. 

Operation: 

— 9 

I: target_of f set <— sign_extend (of f set MO) 

condition <— (GPR[rs] ^ GPR[rt]) 

1+1 : if condition then 

PC <— PC + target_of f set 
else 

NullifyCurrentlnstruction ( ) 
endif 


Exceptions: 

None 
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BNEL 


Branch on Not Equal Likely (cont.) 


Programming Notes: 

With the 18-bit signed instruction offset, the conditional branch range is ± 128 KBytes. Use jump (J) or jump register 
(JR) instructions to branch to addresses outside this range. 

Software is strongly encouraged to avoid the use of the Branch Likely instructions, as they will be removed from a 
future revision of the MIPS Architecture. 

Some implementations always predict the branch will be taken, so there is a significant penalty if the branch is not 
taken. Software should only use this instruction when there is a very high probability (98% or more) that the branch 
will be taken. If the branch is not likely to be taken or if the probability of a taken branch is unknown, software is 
encouraged to use the BNE instruction instead. 

Historical Information: 

In the MIPS I architecture, this instruction signaled a Reserved Instruction Exception. 
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Breakpoint 


BREAK 

31 26 25 


6 5 0 

SPECIAL 

code 

BREAK 

000000 


001101 

6 

20 

6 

Format: break 

Purpose: 

To cause a Breakpoint exception 

Description: 


MIPS32 


A breakpoint exception occurs, immediately and unconditionally transferring control to the exception handler. The 
code field is available for use as software parameters, but is retrieved by the exception handler only by loading the 
contents of the memory word containing the instruction. 

Restrictions: 

None 

Operation: 

SignalException (Breakpoint) 

Exceptions: 

Breakpoint 
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Floating Point Compare 


C.cond.fmt 


31 26 25 21 20 16 15 11 10 8 7 6 5 4 3 0 


COP1 






A 

FC 



fmt 

ft 

fs 

CC 

0 



cond 

010001 






0 

11 



6 

5 



5 

5 

3 

1 1 2 

4 

Format: 

C.cond.S fs, 

ft 

(cc = 

0 implied) 




MIPS32 


C.cond.D fs, 

ft 

(cc = 

0 implied) 




MIPS32 


C.cond.PS fs, 

ft 

(cc = 

0 implied) 



MIPS32 

MIPS64 
Release 2 


C.cond.S cc, 

fs. 

ft 





MIPS32 


C.cond.D cc, 

fs. 

ft 





MIPS32 


C.cond.PS cc, 

. f s 

, ft 




MIPS32 

MIPS64 
Release 2 


Purpose: 

To compare FP values and record the Boolean result in a condition code 

Description: cc 4- fs compare_cond ft 

The value in FPR /,v is compared to the value in FPR /h the values are in format fmt. The comparison is exact and nei- 
ther overflows nor underflows. 

If the comparison specified by cond 2 / is true for the operand values, the result is true; otherwise, the result is false. If 
no exception is taken, the result is written into condition code CC; true is 1 and false is 0. 

c.cond.PS compares the upper and lower halves of FPR /,v and FPR/t independently and writes the results into condi- 
tion codes CC +1 and CC respectively. The CC number must be even. If the number is not even the operation of the 
instruction is UNPREDICTABLE. 

If one of the values is an SNaN, or condj is set and at least one of the values is a QNaN, an Invalid Operation condi- 
tion is raised and the Invalid Operation flag is set in the FCSR. If the Invalid Operation Enable bit is set in the FCSR , 
no result is written and an Invalid Operation exception is taken immediately. Otherwise, the Boolean result is written 
into condition code CC. 

There are four mutually exclusive ordering relations for comparing floating point values; one relation is always true 
and the others are false. The familiar relations are greater than, less than, and equal. In addition, the IEEE floating 
point standard defines the relation unordered, which is true when at least one operand value is NaN; NaN compares 
unordered with everything, including itself. Comparisons ignore the sign of zero, so +0 equals -0. 

The comparison condition is a logical predicate, or equation, of the ordering relations such as less than or equal, 
equal, not less than, or unordered or equal. Compare distinguishes among the 16 comparison predicates. The Bool- 
ean result of the instruction is obtained by substituting the Boolean value of each ordering relation for the two FP val- 
ues in the equation. If the equal relation is true, for example, then all four example predicates above yield a true 
result. If the unordered relation is true then only the final predicate, unordered or equal, yields a true result. 

Logical negation of a compare result allows eight distinct comparisons to test for the 16 predicates as shown in . Each 
mnemonic tests for both a predicate and its logical negation. For each mnemonic, compare tests the truth of the first 
predicate. When the first predicate is true, the result is true as shown in the “If Predicate Is True” column, and the sec- 
ond predicate must be false, and vice versa. (Note that the False predicate is never true and False/True do not follow 
the normal pattern.) 

The truth of the second predicate is the logical negation of the instruction result. After a compare instruction, test for 
the truth of the first predicate can be made with the Branch on FP True (BC1T) instruction and the truth of the second 
can be made with Branch on FP False (BC1F). 
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Floating Point Compare (cont.) 


C.cond.fmt 


Table 3-25 shows another set of eight compare operations, distinguished by a cond j value of 1 and testing the same 16 
conditions. For these additional comparisons, if at least one of the operands is a NaN, including Quiet NaN, then an 
Invalid Operation condition is raised. If the Invalid Operation condition is enabled in the FCSR , an Invalid Operation 
exception occurs. 


Table 3-25 FPU Comparisons Without Special Operand Exceptions 








Comparison CC 



Instruction 

Comparison Predicate 





Result 

Instruction 



Relation 


InvOp 

Condition 




Values 



Excp. 

Field 







If 

if 











Cond 

Name of Predicate and 





Predicate 

QNaN 



Mnemonic 

Logically Negated Predicate (Abbreviation) 

> 

< 

= 

? 

Is True 

9 

3 

2..0 


False [this predicate is always False] 

F 

F 

F 

F 














0 











True (T) 

T 

T 

T 

T 





UN 

Unordered 

F 

F 

F 

T 

T 



1 

Ordered (OR) 

T 

T 

T 

F 

F 





EQ 

Equal 

F 

F 

T 

F 

T 



9 

Not Equal (NEQ) 

T 

T 

F 

T 

F 





UEQ 

Unordered or Equal 

F 

F 

T 

T 

T 



3 

Ordered or Greater Than or Less Than (OGL) 

T 

T 

F 

F 

F 

No 

0 













OLT 

Ordered or Less Than 

F 

T 

F 

F 

T 



4 

Unordered or Greater Than or Equal (UGE) 

T 

F 

T 

T 

F 





ULT 

Unordered or Less Than 

F 

T 

F 

T 

T 



5 

Ordered or Greater Than or Equal (OGE) 

T 

F 

T 

F 

F 





OLE 

Ordered or Less Than or Equal 

F 

T 

T 

F 

T 



6 

Unordered or Greater Than (UGT) 

T 

F 

F 

T 

F 





ULE 

Unordered or Less Than or Equal 

F 

T 

T 

T 

T 



7 

Ordered or Greater Than (OGT) 

T 

F 

F 

F 

F 






Key: ? = unordered , > = greater than , < = less than , = is equal, T = True, F = False 
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Floating Point Compare (cont.) 


C.cond.fmt 


Table 3-26 FPU Comparisons With Special Operand Exceptions for QNaNs 


Instruction 

Comparison Predicate 

Comparison CC 
Result 

Instructio 

n 

Cond 

Mnemonic 

Name of Predicate and 
Logically Negated Predicate (Abbreviation) 

Relation 

Values 

If 

Predicate 
Is True 

Inv Op 
Excp If 
QNaN? 

Condition 

Field 

> 

< 

= 

9 

3 

2..0 

SF 

Signaling False [this predicate always False] 

F 

F 

F 

F 

F 

Yes 

1 

0 

Signaling True (ST) 

T 

T 

T 

T 

NGLE 

Not Greater Than or Less Than or Equal 

F 

F 

F 

T 

T 

1 

Greater Than or Less Than or Equal (GLE) 

T 

T 

T 

F 

F 

SEQ 

Signaling Equal 

F 

F 

T 

F 

T 

2 

Signaling Not Equal (SNE) 

T 

T 

F 

T 

F 

NGL 

Not Greater Than or Less Than 

F 

F 

T 

T 

T 

3 

Greater Than or Less Than (GL) 

T 

T 

F 

F 

F 

LT 

Less Than 

F 

T 

F 

F 

T 

4 

Not Less Than (NLT) 

T 

F 

T 

T 

F 

NGE 

Not Greater Than or Equal 

F 

T 

F 

T 

T 

5 

Greater Than or Equal (GE) 

T 

F 

T 

F 

F 

LE 

Less Than or Equal 

F 

T 

T 

F 

T 

6 

Not Less Than or Equal (NLE) 

T 

F 

F 

T 

F 

NGT 

Not Greater Than 

F 

T 

T 

T 

T 

7 

Greater Than (GT) 

T 

F 

F 

F 

F 


Key: ? = unordered , > = greater than , < = less than , = is equal , T = True, F = False 
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Floating Point Compare (cont.) 


C.cond.fmt 


Restrictions: 

The ticlcls/'s and ft must specify FPRs valid for operands of type/mf; if they are not valid, the result is UNPREDICT- 
ABLE. 

The operands must be values in format fmt; if they are not, the result is UNPREDICTABLE and the value of the 
operand FPRs becomes UNPREDICTABLE. 

The result of C.cond.PS is UNPREDICTABLE if the processor is executing in 16 FP registers mode, or if the condi- 
tion code number is odd. 

Operation: 

if SNaN (ValueFPR ( f s , fmt) ) or SNaN (ValueFPR ( ft , fmt)) or 
QNaN (ValueFPR ( f s , fmt)) or QNaN (ValueFPR ( ft , fmt)) then 
less <— false 
equal 4— false 
unordered 4- true 

if (SNaN (ValueFPR (fs, fmt) ) or SNaN (ValueFPR ( ft , fmt )) ) or 
(cond 3 and (QNaN (ValueFPR ( fs , fmt ) ) or QNaN (ValueFPR ( ft , fmt ))) ) then 
Signal Except ion ( InvalidOperat ion) 
endif 
else 

less <— ValueFPR(fs, fmt) < fmt ValueFPR(ft, fmt) 
equal <— ValueFPR (fs, fmt) =f m t ValueFPR (ft, fmt) 
unordered 4— false 
endif 

condition <— (cond 2 and less) or (condg and equal) 
or (condg and unordered) 

SetFPConditionCode (cc, condition) 

For c.cond.PS, the pseudo code above is repeated for both halves of the operand registers, treating each half as an 
independent single-precision values. Exceptions on the two halves are logically ORed and reported together. The 
results of the lower half comparison are written to condition code CC; the results of the upper half comparison are 
written to condition code CC+ 1 . 
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Floating Point Compare (cont.) 


C.cond.fmt 


Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Floating Point Exceptions: 

Unimplemented Operation, Invalid Operation 

Programming Notes: 

FP computational instructions, including compare, that receive an operand value of Signaling NaN raise the Invalid 
Operation condition. Comparisons that raise the Invalid Operation condition for Quiet NaNs in addition to SNaNs 
permit a simpler programming model if NaNs are errors. Using these compares, programs do not need explicit code 
to check for QNaNs causing the unordered relation. Instead, they take an exception and allow the exception handling 
system to deal with the error when it occurs. For example, consider a comparison in which we want to know if two 
numbers are equal, but for which unordered would be an error. 

# comparisons using explicit tests for QNaN 

c.eq.d $f2,$f4# check for equal 
nop 

belt L2 # it is equal 

c.un.d $f2,$f4# it is not equal, 

# but might be unordered 

belt ERROR # unordered goes off to an error handler 

# not-equal-case code here 

# equal-case code here 
L2 : 

# 

# comparison using comparisons that signal QNaN 

c.seq.d $f2,$f4 # check for equal 

nop 

belt L2 # it is equal 

nop 

# it is not unordered here 

# not-equal-case code here 

# equal-case code here 
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CACHE 


Perform Cache Operation 


31 26 25 21 20 16 15 0 


CACHE 

101111 

base 

op 

offset 

6 

5 

5 

16 

Format: cache 

op, offset (base) 

MIPS32 

Purpose: 





To perform the cache operation specified by op. 

Description: 

The 16-bit offset is sign-extended and added to the contents of the base register to form an effective address. The 
effective address is used in one of the following ways based on the operation to be performed and the type of cache as 
described in the following table. 


Table 3-27 Usage of Effective Address 


Operation 
Requires an 

Type of 
Cache 

Usage of Effective Address 

Address 

Virtual 

The effective address is used to address the cache. It is implementation dependent 
whether an address translation is performed on the effective address (with the 
possibility that a TLB Refill or TLB Invalid exception might occur) 

Address 

Physical 

The effective address is translated by the MMU to a physical address. The physical 
address is then used to address the cache 



The effective address is translated by the MMU to a physical address. It is 
implementation dependent whether the effective address or the translated physical 
address is used to index the cache. 

Index 

N/A 

Assuming that the total cache size in bytes is CS, the associativity is A, and the 
number of bytes per tag is BPT. the following calculations give the fields of the 
address which specify the way and the index: 

OffsetBit Log2 (BPT) 

IndexBit <— Log2 (CS / A) 

WayBit <— IndexBit + Ceiling (Log2 (A) ) 

. . IndexBit 

Index <— Addr IndexBit _ 1 _ .OffsetBit 

For a direct-mapped cache, the Way calculation is ignored and the Index value 
fully specifies the cache tag. This is shown symbolically in the figure below. 
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Perform Cache Operation 


CACHE 


Figure 3-2 Usage of Address Fields to Select Index and Way 


- WayBit >■ — IndexBit 




OffsetBit 


Unused 

Way 

Index 

byte index 


A TLB Refill and TLB Invalid (both with cause code equal TLBL) exception can occur on any operation. For index 
operations (where the address is used to index the cache but need not match the cache tag) software should use 
unmapped addresses to avoid TLB exceptions. This instruction never causes TLB Modified exceptions nor TLB 
Refill exceptions with a cause code of TLBS. 

The effective address may be an arbitrarily-aligned by address. The CACHE instruction never causes an Address 
Error Exception due to an non-aligned address. 

A Cache Error exception may occur as a byproduct of some operations performed by this instruction. For example, if 
a Writeback operation detects a cache or bus error during the processing of the operation, that error is reported via a 
Cache Error exception. Similarly, a Bus Error Exception may occur if a bus operation invoked by this instruction is 
terminated in an error. However, cache error exceptions should must be triggered by an Index Load Tag or Index 
Store tag operation, as these operations are used for initialization and diagnostic purposes. 

An Address Error Exception (with cause code equal AdEL) may occur if the effective address references a portion of 
the kernel address space which would normally result in such an exception. It is implementation dependent whether 
such an exception does occur. 

It is implementation dependent whether a data watch is triggered by a cache instruction whose address matches the 
Watch register address match conditions. 

Bits [17:16] of the instruction specify the cache on which to perform the operation, as follows: 

Table 3-28 Encoding of Bits[17:16] of CACHE Instruction 


Code 

Name 

Cache 

2#00 

I 

Primary Instruction 

2#01 

D 

Primary Data or Unified Primary 

2#10 

T 

Tertiary 

2#1 1 

S 

Secondary 


Bits [20:18] of the instruction specify the operation to perform. To provide software with a consistent base of cache 
operations, certain encodings must be supported on all processors. The remaining encodings are recommended. 
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Perform Cache Operation 


CACHE 


Table 3-29 Encoding of Bits [20:18] of the CACHE Instruction 


Code 

Caches 

Name 

Effective 

Address 

Operand 

Type 

Operation 

Compliance 


I 

Index Invalidate 

Index 

Set the state of the cache block at the specified 
index to invalid. 

This required encoding may be used by 
software to invalidate the entire instruction 
cache by stepping through all valid indices. 

Required 

2#000 

D 

Index Writeback 
Invalidate / Index 
Invalidate 

Index 

For a write-back cache: If the state of the cache 
block at the specified index is valid and dirty, 
write the block back to the memory address 
specified by the cache tag. After that operation 
is completed, set the state of the cache block to 
invalid. If the block is valid but not dirty, set the 
state of the block to invalid. 

Required 







S, T 

Index Writeback 
Invalidate / Index 
Invalidate 

Index 

For a write-through cache: Set the state of the 
cache block at the specified index to invalid. 

This required encoding may be used by 
software to invalidate the entire data cache by 
stepping through all valid indices. Note that 
Index Store Tag should be used to initialize the 
cache at powerup. 

Optional 

2#001 

All 

Index Load Tag 

Index 

Read the tag for the cache block at the specified 
index into the TagLo and TagHi Coprocessor 0 
registers. If the DataLo and DataHi registers 
are implemented, also read the data 
corresponding to the byte index into the 
DataLo and DataHi registers. This operation 
must not cause a Cache Error Exception. 

The granularity and alignment of the data read 
into the DataLo and DataHi registers is 
implementation-dependent, but is typically the 
result of an aligned access to the cache, 
ignoring the appropriate low-order bits of the 
byte index. 

Recommended 
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Table 3-29 Encoding of Bits [20:18] of the CACHE Instruction 


Code 

Caches 

Name 

Effective 

Address 

Operand 

Type 

Operation 

Compliance 

2#010 

All 

Index Store Tag 

Index 

Write the tag for the cache block at the 
specified index from the TagLo and TagHi 
Coprocessor 0 registers. This operation must 
not cause a Cache Error Exception. 

This required encoding may be used by 
software to initialize the entire instruction or 
data caches by stepping through all valid 
indices. Doing so requires that the TagLo and 
TagHi registers associated with the cache be 
initialized first. 

Required 

2#01 1 

All 

Implementation 

Dependent 

Unspecified 

Available for implementation-dependent 
operation. 

Optional 

2# 100 

I, D 

Hit Invalidate 

Address 

If the cache block contains the specified 
address, set the state of the cache block to 
invalid. 

This required encoding may be used by 
software to invalidate a range of addresses 
from the instruction cache by stepping through 
the address range by the line size of the cache. 

Required 

(Instruction Cache 
Encoding Only), 
Recommended 
otherwise 

S, T 

Hit Invalidate 

Address 

Optional 
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Table 3-29 Encoding of Bits [20:18] of the CACHE Instruction 


Code 

Caches 

Name 

Effective 

Address 

Operand 

Type 

Operation 

Compliance 

2#101 

I 

Fill 

Address 

Fill the cache from the specified address. 

Recommended 

D 

Hit Writeback 
Invalidate / Hit 
Invalidate 

Address 

For a write-back cache: If the cache block 
contains the specified address and it is valid 
and dirty, write the contents back to memory. 
After that operation is completed, set the state 
of the cache block to invalid. If the block is 
valid but not dirty, set the state of the block to 
invalid. 

For a write-through cache: If the cache block 
contains the specified address, set the state of 
the cache block to invalid. 

This required encoding may be used by 
software to invalidate a range of addresses 
from the data cache by stepping through the 
address range by the line size of the cache. 

Required 

S, T 

Hit Writeback 
Invalidate / Hit 
Invalidate 

Address 

Optional 

2#1 10 

D 

Hit Writeback 

Address 

If the cache block contains the specified 
address and it is valid and dirty, write the 
contents back to memory. After the operation is 
completed, leave the state of the line valid, but 
clear the dirty state. For a write-through cache, 
this operation may be treated as a nop. 

Recommended 

S, T 

Hit Writeback 

Address 

Optional 
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Table 3-29 Encoding of Bits [20:18] of the CACHE Instruction 


Code 

Caches 

Name 

Effective 

Address 

Operand 

Type 

Operation 

Compliance 

2#1 1 1 

I, D 

Fetch and Lock 

Address 

If the cache does not contain the specified 
address, fill it from memory, performing a 
writeback if required, and set the state to valid 
and locked. If the cache already contains the 
specified address, set the state to locked. In 
set-associative or fully-associative caches, the 
way selected on a fill from memory is 
implementation dependent. 

The lock state may be cleared by executing an 
Index Invalidate, Index Writeback Invalidate, 
Hit Invalidate, or Hit Writeback Invalidate 
operation to the locked line, or via an Index 
Store Tag operation to the line that clears the 
lock bit. Note that clearing the lock state via 
Index Store Tag is dependent on the 
implementation-dependent cache tag and cache 
line organization, and that Index and Index 
Writeback Invalidate operations are dependent 
on cache line organization. Only Hit and Hit 
Writeback Invalidate operations are generally 
portable across implementations. 

It is implementation dependent whether a 
locked line is displaced as the result of an 
external invalidate or intervention that hits on 
the locked line. Software must not depend on 
the locked line remaining in the cache if an 
external invalidate or intervention would 
invalidate the line if it were not locked. 

It is implementation dependent whether a Fetch 
and Lock operation affects more than one line. 
For example, more than one line around the 
referenced address may be fetched and locked. 
It is recommended that only the single line 
containing the referenced address be affected. 

Recommended 
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Perform Cache Operation (cont.) 


CACHE 


Restrictions: 

The operation of this instruction is UNDEFINED for any operation/cache combination that is not implemented. 

The operation of this instruction is UNDEFINED if the operaation requires an address, and that address is uncache- 
able. 

The operation of the instruction is UNPREDICTABLE if the cache line that contains the CACHE instruction is the 
target of an invalidate or a writeback invalidate. 

If access to Coprocessor 0 is not enabled, a Coprocessor Unusable Exception is signaled. 

Operation: 

vAddr <r- GPRfbase] + sign_extend (off set) 

(pAddr, uncached) <— AddressTranslation (vAddr , DataReadReference) 

CacheOp(op, vAddr, pAddr) 

Exceptions: 

TLB Refill Exception. 

TLB Invalid Exception 
Coprocessor Unusable Exception 
Address Error Exception 
Cache Error Exception 
Bus Error Exception 
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Fixed Point Ceiling Convert to Long Fixed Point 


CEIL.L.fmt 


31 26 25 21 20 16 15 11 10 6 5 0 


COP1 


0 



CEIL.L 


fmt 


fs 

fd 


010001 


00000 



001010 


6 5 5 5 5 6 


Format: ceil.l.s 

fd, 

f s 

MIPS64 
MIPS32 Release 2 

CEIL . L . D 

fd, 

f s 

MIPS64 
MIPS32 Release 2 


Purpose: 

To convert an FP value to 64-bit fixed point, rounding up 

Description: fd 4— convert_and_round (fs) 

The value in FPR/s, in format fmt, is converted to a value in 64-bit long fixed point format and rounding toward +°° 
(rounding mode 2). The result is placed in FPR fd. 

When the source value is Infinity, NaN, or rounds to an integer outside the range -2 63 to 2 63 -l, the result cannot be 
represented correctly, an IEEE Invalid Operation condition exists, a d the Invalid Operation flag is set in the FCSR. If 
the Invalid Operation Enable bit is set in the FCSR , no result is written to fd and an Invalid Operation exception is 
taken immediately. Otherwise, the default result, 2 63 -l, is written to fd. 

Restrictions: 

The fields/? and fd must specify valid FPRs;/,v for type fmt and fd for long fixed point; if they are not valid, the result 

is UNPREDICTABLE. 

The operand must be a value in format fmt, if it is not, the result is UNPREDICTABLE and the value of the operand 
FPR becomes UNPREDICTABLE. 

The result of this instruction is UNPREDICTABLE if the processor is executing in 16 FP registers mode. 

Operation: 

StoreFPR(fd, L, ConvertFmt (ValueFPR ( f s , fmt), fmt, L) ) 
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Fixed Point Ceiling Convert to Long Fixed Point (cont.) 


CEIL.L.fmt 


Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Floating Point Exceptions: 

Invalid Operation, Unimplemented Operation, Inexact, Overflow 
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Floating Point Ceiling Convert to Word Fixed Point 


CEIL.W.fmt 


31 26 25 21 20 16 15 11 10 6 5 0 


COP1 


0 



CEIL.W 


fmt 


fs 

fd 


010001 


00000 



001110 


6 

5 


5 

5 

5 

6 

Format: ceil.w.s 

fd, 

f s 




MIPS32 

CEIL . W . D 

fd, 

f s 




MIPS32 


Purpose: 

To convert an FP value to 32-bit fixed point, rounding up 

Description: fd <— convert_and_round (fs) 

The value in FPR/s, in format fmt, is converted to a value in 32-bit word fixed point format and rounding toward +°° 
(rounding mode 2). The result is placed in FPR fd. 

o i o 1 

When the source value is Infinity, NaN, or rounds to an integer outside the range -2 to 2 -1, the result cannot be 
represented correctly, an IEEE Invalid Operation condition exists, and the Invalid Operation flag is set in the FCSR. If 
the Invalid Operation Enable bit is set in the FCSR , no result is written to fd and an Invalid Operation exception is 
taken immediately. Otherwise, the default result, 2 -1, is written to fd. 

Restrictions: 

The fields/s and fd must specify valid FPRs;/,v for type fmt and fd for word fixed point; if they are not valid, the result 

is UNPREDICTABLE. 

The operand must be a value in format fmt, if it is not, the result is UNPREDICTABLE and the value of the operand 
FPR becomes UNPREDICTABLE. 

Operation: 

StoreFPR(fd, W, ConvertFmt (ValueFPR ( f s , fmt), fmt, W) ) 

Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Floating Point Exceptions: 

Invalid Operation, Unimplemented Operation, Inexact, Overflow 
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Move Control Word From Floating Point 


CFC1 


31 26 25 21 20 16 15 11 10 0 


COP1 

CF 



0 



rt 

fs 


010001 

00010 



000 0000 0000 


6 5 

5 

5 

11 


Format: CFC1 rt, fs 




MIPS32 

Purpose: 






To copy a word from an FPU control register to a GPR 

Description: rt <- FP_Control [fs] 

Copy the 32-bit word from FP (coprocessor 1) control register fs into GPR rt. 

Restrictions: 

There are a few control registers defined for the floating point unit. The result is UNPREDICTABLE if fs specifies a 
register that does not exist. 

Operation: 

if fs = 0 then 


temp <— 

FIR 



elseif fs = 

= 25 

then 


temp <— 

0 24 

M FCSR 31 .. 25 | 

1 fcsr 23 

elseif fs = 

= 26 

then 


temp <— 

0 14 

1 1 FCSR 17..12 1 

o 5 | FCSRg. .2 | | 0 

elseif fs = 

= 28 

then 


temp <— 

O 20 

1 1 FCSR 11-7 ! 1 

0 4 I 1 FCSR 24 1 1 FCSR 

elseif fs = 

= 31 

then 



temp <- FCSR 
else 

temp <- UNPREDICTABLE 

endif 

GPR[rt] <r- temp 
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Move Control Word From Floating Point (cont.) 


CFC1 


Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Historical Information: 

For the MIPS I, II and III architectures, the contents of GPR rt are UNPREDICTABLE for the instruction immedi- 
ately following CFC1. 

MIPS V and MIPS32 introduced the three control registers that access portions of FCSR. These registers were not 
available in MIPS I, II, III, or IV. 
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Move Control Word From Coprocessor 2 


CFC2 


31 26 25 21 20 16 15 11 10 0 


COP2 

CF 





rt 

Impl 

010010 

00010 




6 5 5 16 

Format: CFC2 rt, rd MIPS32 

The syntax shown above is an example using CFC1 as a model. The specific syntax is implementation dependent. 

Purpose: 

To copy a word from a Coprocessor 2 control register to a GPR 

Description: rt <- CP2CCR[impl] 

Copy the 32-bit word from the Coprocessor 2 control register denoted by the Impl field. The interpretation of the Imp] 
field is left entirely to the Coprocessor 2 implementation and is not specified by the architecture. 

Restrictions: 

The result is UNPREDICTABLE if Imp] specifies a register that does not exist. 

Operation: 

temp <— CP2CCR [ Impl ] 

GPR[rt] 4— temp 

Exceptions: 

Coprocessor Unusable, Reserved Instruction 
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Count Leading Ones in Word 


CLO 


31 26 25 21 20 16 15 11 10 6 5 0 


SPECIAL2 




0 

CLO 


rs 

rt 

rd 



011100 




00000 

100001 


6 5 

5 

5 

5 

6 

Format: CLO rd, rs 




MIPS32 

Purpose: 






To Count the number of leading ones in a word 

Description: rd count_leading_ones rs 

Bits 31..0 of GPR rs are scanned from most significant to least significant bit. The number of leading ones is counted 
and the result is written to GPR rd. If all of bits 31. .0 were set in GPR rs. the result written to GPR rd is 32. 

Restrictions: 

To be compliant with the MIPS32 and MIPS64 Architecture, software must place the same GPR number in both the 
rt and rd fields of the instruction. The operation of the instruction is UNPREDICTABLE if the rt and rd fields of the 
instruction contain different values. 

Operation: 

temp <— 32 

for i in 31 . . 0 

if GPRlrslj^ = 0 then 
temp <— 31 - i 
break 
endif 
endf or 

GPR[rd] <r- temp 

Exceptions: 

None 


MIPS32™ Architecture For Programmers Volume II, Revision 2.00 


103 


Copyright © 2001-2003 MIPS Technologies Inc. All rights reserved. 




Count Leading Zeros in Word 


CLZ 


31 26 25 21 20 16 15 11 10 6 5 0 


SPECIAL2 




0 

CLZ 


rs 

rt 

rd 



011100 




00000 

100000 


6 5 

5 

5 

5 

6 

Format: CLZ rd, rs 




MIPS32 

Purpose 






Count the number of leading zeros in a word 

Description: rd <r- count_leading_zeros rs 

Bits 3 1 . .0 of GPR rs are scanned from most significant to least significant bit. The number of leading zeros is counted 
and the result is written to GPR rd. If no bits were set in GPR rs, the result written to GPR rt is 32. 

Restrictions: 

To be compliant with the MIPS32 and MIPS64 Architecture, software must place the same GPR number in both the 
rt and rd fields of the instruction. The operation of the instruction is UNPREDICTABLE if the rt and rd fields of the 
instruction contain different values. 

Operation: 

temp <— 32 

for i in 31 . . 0 

if GPRlrslj^ = 1 then 
temp <— 31 - i 
break 
endif 
endf or 

GPR[rd] <r- temp 

Exceptions: 

None 
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Coprocessor Operation to Coprocessor 2 


COP2 



Format: COP 2 func MIPS32 

Purpose: 

To performance an operation to Coprocessor 2 

Description: CoprocessorOperation (2, cofun) 

An implementation-dependent operation is performance to Coprocessor 2, with the cofun value passed as an argu- 
ment. The operation may specify and reference internal coprocessor registers, and may change the state of the copro- 
cessor conditions, but does not modify state within the processor. Details of coprocessor operation and internal state 
are described in the documentation for each Coprocessor 2 implementation. 

Restrictions: 

Operation: 

CoprocessorOperation (2, cofun) 

Exceptions: 

Coprocessor Unusable 
Reserved Instruction 
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Move Control Word to Floating Point 


CTC1 


31 26 25 21 20 16 15 11 10 0 


COP1 

CT 



0 



rt 

fs 


010001 

00110 



000 0000 0000 


6 

5 

5 

5 

11 


Format: CTC1 

rt , f s 




MIPS32 

Purpose: 







To copy a word from a GPR to an FPU control register 

Description: FP_Control [fs] 4- rt 

Copy the low word from GPR rt into the FP (coprocessor 1) control register indicated by fs. 

Writing to the floating point Control/Status register, the FCSR , causes the appropriate exception if any Cause bit and 
its corresponding Enable bit are both set. The register is written before the exception occurs. Writing to FEXR to set a 
cause bit whose enable bit is already set, or writing to FENR to set an enable bit whose cause bit is already set causes 
the appropriate exception. The register is written before the exception occurs. 

Restrictions: 

There are a few control registers defined for the floating point unit. The result is UNPREDICTABLE if fs specifies a 
register that does not exist. 
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Move Control Word to Floating Point (cont.) 


CTC1 


Operation: 

temp <— GPR[rt] 31-- Q 
if fs = 25 then 

2 4 

if temp 31--8 ^ 0 then 

UNPREDICTABLE 

else 

FCSR <— temp 7> x | | FCSR 24 I 1 temp 0 | | FCSR 2 2 ..o 
endif 

elseif fs = 26 then 

if temp 22 ..i8 ^ 0 then 

UNPREDICTABLE 

else 

FCSR <- FCSR 31--18 || temp 17 .. 12 | FCSR U .. 7 II 

t emp e . . 2 II FCSR i..o 
endif 

elseif fs = 28 then 

if temp 22 ..i8 ^ 0 then 

UNPREDICTABLE 

else 

FCSR <- FCSR 31--25 II temp 2 || FCSR 23 .. 12 I I temp 11--7 
I | FCSR 6--2 | | tem Pl _ _ 0 
endif 

elseif fs = 31 then 

if temp 22 ..i8 ^ 0 then 

UNPREDICTABLE 

else 

FCSR <- temp 
endif 
else 

UNPREDICTABLE 

endif 

Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Floating Point Exceptions: 

Unimplemented Operation, Invalid Operation, Division-by-zero, Inexact, Overflow, Underflow 

Historical Information: 

For the MIPS I, II and III architectures, the contents of floating point control register fs are undefined for the instruc- 
tion immediately following CTC 1 . 

MIPS V and MIPS32 introduced the three control registers that access portions of FCSR. These registers were not 
available in MIPS I, II, III, or IV. 
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Move Control Word to Coprocessor 2 CTC2 


31 26 25 21 20 16 15 11 10 0 


COP2 

CT 





rt 

Impl 

010010 

00110 




6 5 5 16 

Format: CTC2 rt, rd MIPS32 

The syntax shown above is an example using CTC1 as a model. The specific syntax is implementation dependent. 

Purpose: 

To copy a word from a GPR to a Coprocessor 2 control register 

Description: CP2CCR[impl] <- rt 

Copy the low word from GPR rt into the Coprocessor 2 control register denoted by the Impl field. The interpretation 
of the Impl field is left entirely to the Coprocessor 2 implementation and is not specified by the architecture. 

Restrictions: 

The result is UNPREDICTABLE if rd specifies a register that does not exist. 


Operation: 

temp <— GPR[rt] 

CP2CCR [ Impl ] <— temp 

Exceptions: 

Coprocessor Unusable, Reserved Instruction 
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Floating Point Convert to Double Floating Point 


CVT.D.fmt 


31 26 25 21 20 16 15 11 10 6 5 0 


COP1 


0 



CVT.D 


fmt 


fs 

fd 


010001 


00000 



100001 


6 5 5 5 5 6 


Format: CVT . D . S 

fd. 

f s 

MIPS32 

CVT . D . W 

fd, 

f s 

MIPS32 

CVT . D . L 

fd. 

f s 

MIPS64 


MIPS32 Release 2 


Purpose: 

To convert an FP or fixed point value to double FP 

Description: fd convert_and_round (fs) 

The value in FPR/s, in format fmt, is converted to a value in double floating point format and rounded according to 
the current rounding mode in FCSR. The result is placed in FPR fd. If fmt is S or W, then the operation is always 
exact. 

Restrictions: 

The fields fs and fd must specify valid FPRs — fs for type fmt and fd for double floating point — if they are not valid, 
the result is UNPREDICTABLE. 

The operand must be a value in format fmt; if it is not, the result is UNPREDICTABLE and the value of the operand 
FPR becomes UNPREDICTABLE. 

For CVT.D.L, the result of this instruction is UNPREDICTABLE if the processor is executing in 16 FP registers 
mode. 

Operation: 

StoreFPR (fd, D, ConvertFmt (ValueFPR ( fs, fmt), fmt, D)) 

Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Floating Point Exceptions: 

Invalid Operation, Unimplemented Operation, Inexact 
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Floating Point Convert to Long Fixed Point 


CVT.L.fmt 


31 26 25 21 20 16 15 11 10 6 5 0 


COP1 


0 



CVT.L 


fmt 


fs 

fd 


010001 


00000 



100101 


6 5 5 5 5 6 


Format: CVT.L. S fd, fs 

MIPS64 


MIPS32 Release 2 

CVT.L. D fd, fs 

MIPS64 


MIPS32 Release 2 


Purpose: 

To convert an FP value to a 64-bit fixed point 

Description: fd 4— convert_and_round (fs) 

Convert the value in format fmt in FPR fs to long fixed point format and round according to the current rounding 
mode in FCSR. The result is placed in FPR /d. 

When the source value is Infinity, NaN, or rounds to an integer outside the range -2 63 to 2 63 -l, the result cannot be 
represented correctly, an IEEE Invalid Operation condition exists, and the Invalid Operation flag is set in the FCSR. 
If the Invalid Operation Enable bit is set in the FCSR , no result is written to fd and an Invalid Operation exception is 
taken immediately. Otherwise, the default result, 2 63 -l, is written to fd. 

Restrictions: 

The fields fs and fd must specify valid FPRs — fs for type fmt and fd for long fixed point — if they are not valid, the 
result is UNPREDICTABLE. 

The operand must be a value in format fmt, if it is not, the result is UNPREDICTABLE and the value of the operand 
FPR becomes UNPREDICTABLE. 

The result of this instruction is UNPREDICTABLE if the processor is executing in 16 FP registers mode. 

Operation: 

StoreFPR (fd, L, ConvertFmt (ValueFPR ( fs, fmt), fmt, L) ) 
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Floating Point Convert to Long Fixed Point, cont. 


CVT.L.fmt 


Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Floating Point Exceptions: 

Invalid Operation, Unimplemented Operation, Inexact, Overflow 
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Floating Point Convert Pair to Paired Single 


CVT.PS.S 


31 


26 25 


21 20 


16 15 


11 10 


6 5 


COP1 

010001 


fmt 

10000 


fs 


fd 


CVT.PS 

100110 


Format: CVT.PS.S fd, fs, ft MIPS64 

MIPS32 Release 2 

Purpose: 

To convert two FP values to a paired single value 
Description: fd <- fs 31 0 | | ft 31 0 

The single-precision values in FPR fs and/f are written into FPR fd as a paired-single value. The value in FPR fs is 
written into the upper half, and the value in FPR/f is written into the lower half. 


fs ft 


1 


31 

0 

31 

0 

1 

' fd 

1 

r 


i 

63 32 31 0 


CVT.PS.S is similar to PLL.PS, except that it expects operands of format S instead of PS. 

The move is non-arithmetic; it causes no IEEE 754 exceptions. 

Restrictions: 

The fields fs and ft must specify FPRs valid for operands of type .S’; if they are not valid, the result is UNPREDICT- 
ABLE. 

The operand must be a value in format .S; if it is not, the result is UNPREDICTABLE and the value of the operand 
FPR becomes UNPREDICTABLE. 

The result of this instruction is UNPREDICTABLE if the processor is executing in 16 FP registers mode. 
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Floating Point Convert Pair to Paired Single (cont.) 


CVT.PS.S 


Operation: 

StoreFPR ( fd, S, ValueFPR (f s, S) II ValueFPR (ft , S) ) 


Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Floating Point Exceptions: 

Invalid Operation, Unimplemented Operation 
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Floating Point Convert to Single Floating Point 


CVT.S.fmt 


31 26 25 21 20 16 15 11 10 6 5 0 


COP1 


0 



CVT.S 


fmt 


fs 

fd 


010001 


00000 



100000 


6 5 5 5 5 6 


Format: CVT . S . D 

fd. 

f S 

MIPS32 

CVT . S . W 

fd, 

f s 

MIPS32 

CVT . S . L 

fd. 

f s 

MIPS64 


MIPS32 Release 2 


Purpose: 

To convert an FP or fixed point value to single FP 

Description: fd <— convert_and_round (fs) 

The value in FPR fs, in format fmt, is converted to a value in single floating point format and rounded according to the 
current rounding mode in FCSR. The result is placed in FPR fd. 

Restrictions: 

The fields fs and fd must specify valid FPRs — fs for type fint and fd for single floating point. If they are not valid, the 
result is UNPREDICTABLE. 

The operand must be a value in format fmt; if it is not, the result is UNPREDICTABLE and the value of the operand 
FPR becomes UNPREDICTABLE. 

For CVT.S.L, the result of this instruction is UNPREDICTABLE if the processor is executing in 16 FP registers 
mode. 

Operation: 

StoreFPR(fd, S, ConvertFmt (ValueFPR ( f s , fmt), fmt, S)) 

Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Floating Point Exceptions: 

Invalid Operation, Unimplemented Operation, Inexact, Overflow, Underflow 
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Floating Point Convert Pair Lower to Single Floating Point 


CVT.S.PL 


31 26 25 21 20 16 15 11 10 6 5 0 


COP1 

fmt 

0 



CVT.S.PL 




fs 

fd 


010001 

10110 

00000 



101000 


6 5 

5 

5 

5 

6 

Format: CVT.S.PL fd, fs 




MIPS64 





MIPS32 Release 2 

Purpose: 






To convert one half of a paired single FP value to single FP 

Description: fd convert_and_round (fs) 

The lower paired single value in FPR fs, in format PS, is converted to a value in single floating point format and 
rounded according to the current rounding mode in FCSR. The result is placed in \ VR fd. This instruction can be used 
to isolate the lower half of a paired single value. 

Restrictions: 

The fields fs and fd must specify valid FPRs — fs for type PS and fd for single floating point. If they are not valid, the 
result is UNPREDICTABLE. 

The operand must be a value in format PS\ if it is not, the result is UNPREDICTABLE and the value of the operand 
FPR becomes UNPREDICTABLE. 

The result of CVT.S.PL is UNPREDICTABLE if the processor is executing in 16 FP registers mode. 

Operation: 

StoreFPR (fd, S, ConvertFmt (ValueFPR ( fs, PS), PL, S) ) 

Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Floating Point Exceptions: 

Invalid Operation, Unimplemented Operation, Inexact, Overflow, Underflow 
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Floating Point Convert Pair Upper to Single Floating Point 


CVT.S.PU 


31 26 25 21 20 16 15 11 10 6 5 0 


COP1 

fmt 

0 



CVT.S.PU 




fs 

fd 


010001 

10110 

00000 



100000 


6 5 

5 

5 

5 

6 

Format: CVT.S.PU fd, fs 




MIPS64 





MIPS32 Release 2 

Purpose: 






To convert one half of a paired single FP value to single FP 

Description: fd 4— convert_and_round (fs) 

The upper paired single value in FPR fs, in format PS, is converted to a value in single floating point format and 
rounded according to the current rounding mode in FCSR. The result is placed in FPR /i/. This instruction can be used 
to isolate the upper half of a paired single value. 

Restrictions: 

The fields fs and fd must specify valid FPRs — fs for type PS and fd for single floating point. If they are not valid, the 
result is UNPREDICTABLE. 

The operand must be a value in format PS\ if it is not, the result is UNPREDICTABLE and the value of the operand 
FPR becomes UNPREDICTABLE. 

The result of CVT.S.PU is UNPREDICTABLE if the processor is executing in 16 FP registers mode. 

Operation: 

StoreFPR (fd, S, ConvertFmt (ValueFPR ( fs, PS), PU, S)) 

Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Floating Point Exceptions: 

Invalid Operation, Unimplemented Operation, Inexact, Overflow, Underflow 
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Floating Point Convert to Word Fixed Point 


CVT.W.fmt 


31 


26 25 


21 20 16 15 


11 10 


6 5 

0 


COP1 


fmt 

0 

fs 


fd 

CVT.W 



010001 



00000 




100100 



6 


5 

5 

5 


5 

6 



Format: cvt.w.s fd, fs 

cvt.w.d fd, fs 

Purpose: 

To convert an FP value to 32-bit fixed point 

Description: fd convert_and_round (fs) 

The value in FPR fs, in format fmt, is converted to a value in 32-bit word fixed point format and rounded according to 
the current rounding mode in FCSR. The result is placed in FPR /d. 

0101 

When the source value is Infinity, NaN, or rounds to an integer outside the range -2 to 2 -1, the result cannot be 
represented correctly, an IEEE Invalid Operation condition exists, and the Invalid Operation flag is set in the FCSR. If 
the Invalid Operation Enable bit is set in the FCSR , no result is written to fd and an Invalid Operation exception is 
taken immediately. Otherwise, the default result, 2 -1, is written to fd. 

Restrictions: 

The fields fs and fd must specify valid FPRs — fs for type fmt and fd for word fixed point — if they are not valid, the 
result is UNPREDICTABLE. 

The operand must be a value in format fmt, if it is not, the result is UNPREDICTABLE and the value of the operand 
FPR becomes UNPREDICTABLE. 

Operation: 

StoreFPR(fd, W, ConvertFmt (ValueFPR ( f s , fmt), fmt, W) ) 

Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Floating Point Exceptions: 

Invalid Operation, Unimplemented Operation, Inexact, Overflow 


MIPS32 

MIPS32 
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Debug Exception Return 




DERET 
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CO 
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000 0000 0000 0000 0000 



011111 

6 


1 


19 



6 


Format: deret ejtag 

Purpose: 

To Return from a debug exception. 

Description: 

DERET clears execution and instruction hazards, returns from Debug Mode and resumes non-debug execution at the 
instruction whose address is contained in the DEPC register. DERET does not execute the next instruction (i.e. it has 
no delay slot). 

Restrictions: 

A DERET placed between an LL and SC instruction does not cause the SC to fail. 

If the DEPC register with the return address for the DERET was modified by an MTCO or a DMTCO instruction, a 
CPO hazard exists that must be removed via software insertion of the appropriate number of SSNOP instructions (for 
implementations of Release 1 of the Architecture) or by an EHB, or other execution hazard clearing instruction (for 
implementations of Release 2 of the Architecture). 

DERET implements a software barrier that resolves all execution and instruction hazards created by Coprocessor 0 
state changes (for Release 2 implementations, refer to the SYNCI instruction for additional information on resolving 
instruction hazards created by writing the instruction stream). The effects of this barrier are seen starting with the 
instruction fetch and decode of the instruction at the PC to which the DERET returns. 

This instruction is legal only if the processor is executing in Debug Mode. The operation of the processor is UNDE- 
FINED if a DERET is executed in the delay slot of a branch or jump instruction. 
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Debug Exception Return (cont.) 


DERET 


Operation: 

Debug DM <— 0 
Debug IEXI 0 

if IsMIPS16Implemented ( ) then 
PC <- DEPC 31 . .1 II 0 
ISAMode <- DEPC 0 
else 

PC <- DEPC 
endif 

ClearHazards ( ) 


Exceptions: 

Coprocessor Unusable Exception 
Reserved Instruction Exception 
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Disable Interrupts 


DI 
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Format: di 

DI rt 


MIPS32 Release 2 
MIPS32 Release 2 


Purpose: 

To return the previous value of the Status register and disable interrupts. If DI is specified without an argument, GPR 
rO is implied, which discards the previous value of the Status register. 

Description: rt Status; Status IE 0 

The current value of the Status register is loaded into general register rt. The Interrupt Enable (IE) bit in the Status 
register is then cleared. 

Restrictions: 

If access to Coprocessor 0 is not enabled, a Coprocessor Unusable Exception is signaled. 

In implementations prior to Release 2 of the architecture, this instruction resulted in a Reserved Instruction Excep- 
tion. 

Operation: 

This operation specification is for the general interrupt enable/disable operation, with the sc field as a variable. The 
individual instructions DI and El have a specific value for the sc field. 

data <— Status 
GPR[rt] <r- data 
Status IE <— 0 
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Disable Interrupts, cont. 


DI 


Exceptions: 

Coprocessor Unusable 

Reserved Instruction (Release 1 implementations) 


Programming Notes: 

The effects of this instruction are identical to those accomplished by the sequence of reading Status into a GPR, clear- 
ing the IE bit, and writing the result back to Status. Unlike the multiple instruction sequence, however, the DI instruc- 
tion can not be aborted in the middle by an interrupt or exception. 

This instruction creates an execution hazard between the change to the Status register and the point where the change 
to the interrupt enable takes effect. This hazard is cleared by the EHB, IALR.HB, IR.HB, or ERET instructions. Soft- 
ware must not assume that a fixed latency will clear the execution hazard. 
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Divide Word 


DIV 


31 26 25 21 20 16 15 6 5 0 


SPECIAL 



0 

DIV 


rs 

rt 



000000 



00 0000 0000 

011010 


655 10 6 


Format: DIV rs, rt MIPS32 

Purpose: 

To divide a 32-bit signed integers 
Description: (hi, lo) 4- rs / rt 

The 32-bit word value in GPR rs is divided by the 32-bit value in GPR rt, treating both operands as signed values. 
The 32-bit quotient is placed into special register LO and the 32-bit remainder isplaced into special register HI. 

No arithmetic exception occurs under any circumstances. 

Restrictions: 

If the divisor in GPR rt is zero, the arithmetic result value is UNPREDICTABLE. 

Operation: 


q 

LO 

<- GPR[rs] 31 . 

<- q 

.0 div 

GPR | 

: rt ] 31 . . o 

r 

GPR[rs] 31- 

_ 0 mod 

GPR | 

: rt ] 3 i..o 


HI <— r 

Exceptions: 

None 
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Divide Word (cont.) 


DIV 


Programming Notes: 

No arithmetic exception occurs under any circumstances. If divide -by-zero or overflow conditions are detected and 
some action taken, then the divide instruction is typically followed by additional instructions to check for a zero divi- 
sor and/or for overflow. If the divide is asynchronous then the zero-divisor check can execute in parallel with the 
divide. The action taken on either divide -by-zero or overflow is either a convention within the program itself, or more 
typically within the system software; one possibility is to take a BREAK exception with a code field value to signal 
the problem to the system software. 

As an example, the C programming language in a UNIX® environment expects division by zero to either terminate 
the program or execute a program-specified signal handler. C does not expect overflow to cause any exceptional con- 
dition. If the C compiler uses a divide instruction, it also emits code to test for a zero divisor and execute a BREAK 
instruction to inform the operating system if a zero is detected. 

In some processors the integer divide operation may proceed asynchronously and allow other CPU instructions to 
execute before it is complete. An attempt to read LO or HI before the results are written interlocks until the results are 
ready. Asynchronous execution does not affect the program result, but offers an opportunity for performance 
improvement by scheduling the divide so that other instructions can execute in parallel. 

Historical Perspective: 

In MIPS 1 through MIPS III, if either of the two instructions preceding the divide is an MFHI or MFLO, the result of 
the MFHI or MFFO is UNPREDICTABFE. Reads of the HI or LO special register must be separated from subse- 
quent instructions that write to them by two or more instructions. This restriction was removed in MIPS IV and 
MIPS32 and all subsequent levels of the architecture. 
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DIV.fmt 


Floating Point Divide 
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6 

Format: 

DIV. S fd, 

fs, 

ft 




MIPS32 


DIV. D fd, 

fs, 

ft 




MIPS32 


Purpose: 

To divide FP values 

Description: fd <- fs / ft 

The value in FPR /,v is divided by the value in FPR/f. The result is calculated to infinite precision, rounded according 
to the current rounding mode in FCSR , and placed into FPR /f/. The operands and result are values in format fmt. 

Restrictions: 

The fields/s,//, and fd must specify FPRs valid for operands of type fmt; if they are not valid, the result is UNPRED- 
ICABLE. 

The operands must be values in format fmt; if they are not, the result is UNPREDICTABLE and the value of the 
operand FPRs becomes UNPREDICTABLE. 

Operation: 

StoreFPR (fd, fmt, ValueFPR(fs, fmt) / ValueFPR(ft, fmt)) 

Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Floating Point Exceptions: 

Inexact, Invalid Operation, Unimplemented Operation, Division-by-zero, Overflow, Underflow 
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Divide Unsigned Word 

31 26 25 21 20 


16 15 


6 5 

DIVU 

0 
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0 


DIVU 

rs 

rt 
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00 0000 0000 


011011 

6 5 

5 


10 


6 

Format: divu rs, rt 





MIPS32 

Purpose: 






To divide a 32-bit unsigned integers 







Description: (Hi, lo) <— rs / rt 

The 32-bit word value in GPR rs is divided by the 32-bit value in GPR rt, treating both operands as unsigned values. 
The 32-bit quotient is placed into special register LO and the 32-bit remainder is placed into special register HI. 

No arithmetic exception occurs under any circumstances. 

Restrictions: 

If the divisor in GPR rt is zero, the arithmetic result value is UNPREDICTABLE. 

Operation: 

q <— (0 | | GPR[rs] 31 . -0 ) div (° I I GPR 1 rt 1 31 . . 0 > 

r <-(0 1| GPRfrs] 31 . , 0 ) mod (0 || GPR [ rt ] 31 . . 0 ) 

LO <— sign_extend (q 31 _ 0 ) 

HI <— sign_extend ( r 31 __ 0 ) 

Exceptions: 

None 

Programming Notes: 

See “Programming Notes” for the DIV instruction. 

Historical Perspective: 

In MIPS 1 through MIPS III, if either of the two instructions preceding the divide is an MFHI or MFLO, the result of 
the MFHI or MFLO is UNPREDICTABLE. Reads of the HI or LO special register must be separated from subse- 
quent instructions that write to them by two or more instructions. This restriction was removed in MIPS IV and 
MIPS32 and all subsequent levels of the architecture. 
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Execution Hazard Barrier 


EHB 
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Format: ehb MIPS32 Release 2 

Purpose: 

To stop instruction execution until all execution hazards have been cleared. 

Description: 

EHB is the assembly idiom used to denote execution hazard barrier. The actual instruction is interpreted by the hard- 
ware as SLL rO, rO, 3. 

This instruction alters the instruction issue behavior on a pipelined processor by stopping execution until all execu- 
tion hazards have been cleared. Other than those that might be created as a consequence of setting Statuses there 
are no execution hazards visible to an unprivileged program running in User Mode. All execution hazards created by 
previous instructions are cleared for instructions executed immediately following the EHB, even if the EHB is exe- 
cuted in the delay slot of a branch or jump. The EHB instruction does not clear instruction hazards - such hazards are 
cleared by the JALR.HB, JR.HB, and ERET instructions. 

Restrictions: 

None 

Operation: 

ClearExecutionHazards () 

Exceptions: 

None 

Programming Notes: 

In MIPS32 Release 2 implementations, this instruction resolves all execution hazards. On a superscalar processor, 
EHB has alters the instruction issue behavior in a manner identical to SSNOP. For backward compatibility with 
Release 1 implementations, the last of a sequence of SSNOPs can be replaced by an EHB. In Release 1 implementa- 
tions, the EHB will be treated as an SSNOP, thereby preserving the semantics of the sequence. In Release 2 imple- 
mentations, replacing the final SSNOP with an EHB should have no performance effect because a properly sized 
sequence of SSNOPs will have already cleared the hazard. As EHB becomes the standard in MIPS implementations, 
the previous SSNOPs can be removed, leaving only the EHB. 
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Enable Interrupts 
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Format: El 

EI rt 


MIPS32 Release 2 
MIPS32 Release 2 


Purpose: 

To return the previous value of the Status register and enable interrupts. If EI is specified without an argument, GPR 
rO is implied, which discards the previous value of the Status register. 

Description: rt Status; Status IE 1 

The current value of the Status register is loaded into general register rt. The Interrupt Enable (IE) bit in the Status 
register is then set. 

Restrictions: 

If access to Coprocessor 0 is not enabled, a Coprocessor Unusable Exception is signaled. 

In implementations prior to Release 2 of the architecture, this instruction resulted in a Reserved Instruction Excep- 
tion. 

Operation: 

This operation specification is for the general interrupt enable/disable operation, with the sc field as a variable. The 
individual instructions DI and EI have a specific value for the sc field. 

data <— Status 
GPR[rt] <r- data 
Status IE <— 1 
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Enable Interrupts, cont. 


El 


Exceptions: 

Coprocessor Unusable 

Reserved Instruction (Release 1 implementations) 


Programming Notes: 

The effects of this instruction are identical to those accomplished by the sequence of reading Status into a GPR, set- 
ting the IE bit, and writing the result back to Status. Unlike the multiple instruction sequence, however, the El 
instruction can not be aborted in the middle by an interrupt or exception. 

This instruction creates an execution hazard between the change to the Status register and the point where the change 
to the interrupt enable takes effect. This hazard is cleared by the EHB, JALR.HB, IR.HB, or ERET instructions. Soft- 
ware must not assume that a fixed latency will clear the execution hazard. 
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Exception Return 


ERET 
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Format: eret MIPS32 

Purpose: 

To return from interrupt, exception, or error trap. 

Description: 

ERET clears execution and instruction hazards, conditionally restores SRSCtl css from SRSCtl PSS in a Release 2 
implementation, and returns to the interrupted instruction at the completion of interrupt, exception, or error process- 
ing. ERET does not execute the next instruction (i.e., it has no delay slot). 

Restrictions: 

The operation of the processor is UNDEFINED if an ERET is executed in the delay slot of a branch or jump instruc- 
tion. 

An ERET placed between an LL and SC instruction will always cause the SC to fail. 

ERET implements a software barrier that resolves all execution and instruction hazards created by Coprocessor 0 
state changes (for Release 2 implementations, refer to the SYNCI instruction for additional information on resolving 
instruction hazards created by writing the instruction stream). The effects of this barrier are seen starting with the 
instruction fetch and decode of the instruction at the PC to which the ERET returns. 

In a Release 2 implementation, ERET does not restore SRSCtl C s S from SRSCtlpgs if Status BEV = 1, or if Status ERE = 
1 because any exception that sets Status ERE to 1 (Reset, Soft Reset, NMI, or cache error) does not save SRSCtl css in 
SRSCtlp S s. If software sets Status ERE to 1, it must be aware of the operation of an ERET that may be subsequently 
executed. 


MIPS32™ Architecture For Programmers Volume II, Revision 2.00 


129 


Copyright © 2001-2003 MIPS Technologies Inc. All rights reserved. 




Exception Return 


ERET 


Operation: 

if Status ERL = 1 then 
temp <— ErrorEPC 
Status ERL <— 0 
else 

temp <— EPC 
Status EXL <— 0 

if (ArchitectureRevision > 2) and (SRSCtl HS3 > 0) and (Status BEV = 0)then 
SRSCtl css <- SRSCtlp SS 
endif 
endif 

if IsMIPS16Implemented ( ) then 
PC <— temp 31 , i || 0 
ISAMode <— temp 0 
else 

P C <— t emp 
endif 
LLbit <— 0 
ClearHazards ( ) 


Exceptions: 

Coprocessor Unusable Exception 
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Extract Bit Field 


EXT 


31 26 25 21 20 16 15 11 10 6 5 0 


SPECIAL3 

011111 

rs 


rt 

msbd 

(size-1) 

lsb 

(pos) 
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6 

Format: ext 

Purpose: 

5 

rt, rs, pos, size 

5 

5 

5 

6 

MIPS32 Release 2 


To extract a bit field from GPR rs and store it right-justified into GPR rt. 

Description: rt <— ExtractField (rs, msbd, lsb) 

The bit field starting at bit pos and extending for size bits is extracted from GPR rs and stored zero-extended and 
right-justified in GPR rt. The assembly language arguments pos and size are converted by the assembler to the 
instruction fields msbd (the most significant bit of the destination field in GPR rt), in instruction bits 15.. 11, and lsb 
(least significant bit of the source field in GPR rs), in instruction bits 10. .6, as follows: 

msbd <— size-1 
lsb <— pos 

The values of pos and size must satisfy all of the following relations: 

0 < pos < 32 
0 < size < 32 
0 < pos+size < 32 

Figure 3-3 shows the symbolic operation of the instruction. 


pos+size pos+size- 1 pos pos-1 

lsb+msbd+1 lsb+msbd lsb lsb-1 



Figure 3-3 Operation of the EXT Instruction 


Restrictions: 

In implementations prior to Release of the architecture, this instruction resulted in a Reserved Instruction Exception. 
The operation is UNPREDICTABLE if lsb+msbd >31. 
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Extract Bit Field, cont. 


EXT 


Operation: 

if (lsb + msbd) > 31) then 

UNPREDICTABLE 

endif 

temp <- 0 32 " <msbd+1) || GPR[rs] msbd+lsb . , lsb 
GPR[rt] <— temp 


Exceptions: 

Reserved Instruction 
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Floating Point Floor Convert to Long Fixed Point 


FLOOR.L.fmt 


31 26 25 21 20 16 15 11 10 6 5 0 


COP1 


0 



FLOOR.L 


fmt 


fs 

fd 


010001 


00000 



001011 


6 

5 

5 

5 

5 

6 

Format: 

FLOOR. L.S fd, fs 
FLOOR. L.D fd, fs 




MIPS64 
MIPS32 Release 2 
MIPS64 
MIPS32 Release 2 


Purpose: 

To convert an FP value to 64-bit fixed point, rounding down 

Description: fd convert_and_round (fs) 

The value in FPR fs, in format fmt, is converted to a value in 64-bit long fixed point format and rounded toward -°° 
(rounding mode 3). The result is placed in FPR/uf. 

When the source value is Infinity, NaN, or rounds to an integer outside the range -2 63 to 2 63 -l, the result cannot be 
represented correctly, an IEEE Invalid Operation condition exists, and the Invalid Operation flag is set in the FCSR. If 
the Invalid Operation Enable bit is set in the FCSR , no result is written to fd and an Invalid Operation exception is 
taken immediately. Otherwise, the default result, 2 63 -l, is written to fd. 

Restrictions: 

The fields fs and fd must specify valid FPRs — fs for type fmt and fd for long fixed point — if they are not valid, the 
result is UNPREDICTABLE. 

The operand must be a value in format fmt, if it is not, the result is UNPREDICTABLE and the value of the operand 
FPR becomes UNPREDICTABLE. 

The result of this instruction is UNPREDICTABLE if the processor is executing in 16 FP registers mode. 

Operation: 

StoreFPR(fd, L, ConvertFmt (ValueFPR ( f s , fmt), fmt, L) ) 
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Floating Point Floor Convert to Long Fixed Point (cont.) 


FLOOR.L.fmt 


Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Floating Point Exceptions: 

Invalid Operation, Unimplemented Operation, Inexact, Overflow 
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Floating Point Floor Convert to Word Fixed Point 


FLOOR.W.fmt 
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Format: 

FLOOR.W. S 

fd, 

f s 




MIPS32 


FLOOR. W.D 

fd, 

f s 




MIPS32 


Purpose: 

To convert an FP value to 32-bit fixed point, rounding down 

Description: fd convert_and_round (fs) 

The value in FPR fs, in format fmt, is converted to a value in 32-bit word fixed point format and rounded toward -°° 
(rounding mode 3). The result is placed in FPR fd. 

o i o 1 

When the source value is Infinity, NaN, or rounds to an integer outside the range -2 to 2 -1, the result cannot be 
represented correctly, an IEEE Invalid Operation condition exists, and the Invalid Operation flag is set in the FCSR. If 
the Invalid Operation Enable bit is set in the FCSR , no result is written to fd and an Invalid Operation exception is 
taken immediately. Otherwise, the default result, 2 -1, is written to fd. 

Restrictions: 

The fields fs and fd must specify valid FPRs — fs for type fmt and fd for word fixed point — if they are not valid, the 
result is UNPREDICTABLE. 

The operand must be a value in format fmt, if it is not, the result is UNPREDICTABLE and the value of the operand 
FPR becomes UNPREDICTABLE. 

Operation: 

StoreFPR(fd, W, ConvertFmt (ValueFPR ( f s , fmt), fmt, W) ) 

Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Floating Point Exceptions: 

Invalid Operation, Unimplemented Operation, Inexact, Overflow 
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Insert Bit Field 
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Format: ins rt, rs, pos, size 


MIPS32 Release 2 


Purpose: 

To merge a right-justified bit field from GPR rs into a specified field in GPR rt. 


Description: rt <— InsertField (rt, rs, msb, lsb) 

The right-most size bits from GPR rs are merged into the value from GPR rt starting at bit position pos. The result 
isplaced back in GPR rt. The assembly language arguments pos and size are converted by the assembler to the 
instruction fields msb (the most significant bit of the field), in instruction bits 15. .11, and lsb (least significant bit of 
the field), in instruction bits 10.. 6, as follows: 

msb <r- pos + size-1 
lsb <r- pos 

The values of pos and size must satisfy all of the following relations: 

0 < pos < 32 
0 < size < 32 
0 < pos+size < 32 

Figure 3-4 shows the symbolic operation of the instruction. 


31 


size size-1 
msb-lsb+1 msb-lsb 


GPR rs 


GPR rt 
Initial 
Value 


GPR rt 
Final Value 



32-(pos+size) size pos 

32-(msb+l) msb-lsb+1 lsb 


Figure 3-4 Operation of the INS Instruction 
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Insert Bit Field, cont. 


INS 


Restrictions: 

In implementations prior to Release 2 of the architecture, this instruction resulted in a Reserved Instruction Excep- 
tion. 

The operation is UNPREDICTABLE if Isb > msb. 


Operation: 

if lsb > msb) then 

UNPREDICTABLE 

endif 

GPR [ rt ] 4- GPR[rt] 31 .. rasb+1 || GPR [ rs ] msb _ lsb . . „ || GPR [rt] i s b-l . . 0 

Exceptions: 

Reserved Instruction 
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Format: J target MIPS32 

Purpose: 

To branch within the current 256 MB-aligned region 

Description: 

This is a PC-region branch (not PC -relative); the effective target address is in the “current” 256 MB-aligned region. 
The low 28 bits of the target address is the instr_index field shifted left 2 bits. The remaining upper bits are the corre- 
sponding bits of the address of the instruction in the delay slot (not the branch itself). 

Jump to the effective target address. Execute the instruction that follows the jump, in the branch delay slot, before 
executing the jump itself. 

Restrictions: 

Processor operation is UNPREDICTABLE if a branch, jump, ERET, DERET, or WAIT instruction is placed in the 
delay slot of a branch or jump. 

Operation: 

I : 

1+1 :PC <- PCqprleh-!. .28 11 instr_index || 0 2 

Exceptions: 

None 

Programming Notes: 

Forming the branch target address by catenating PC and index bits rather than adding a signed offset to the PC is an 
advantage if all program code addresses fit into a 256 MB region aligned on a 256 MB boundary. It allows a branch 
from anywhere in the region to anywhere in the region, an action not allowed by a signed relative offset. 

This definition creates the following boundary case: When the jump instruction is in the last word of a 256 MB 
region, it can branch only to the following 256 MB region containing the branch delay slot. 
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26 


Format: JAL target MIPS32 

Purpose: 

To execute a procedure call within the current 256 MB-aligned region 

Description: 

Place the return address link in GPR 31. The return link is the address of the second instruction following the branch, 
at which location execution continues after a procedure call. 

This is a PC-region branch (not PC -relative); the effective target address is in the “current” 256 MB-aligned region. 
The low 28 bits of the target address is the instr_index field shifted left 2 bits. The remaining upper bits are the corre- 
sponding bits of the address of the instruction in the delay slot (not the branch itself). 

Jump to the effective target address. Execute the instruction that follows the jump, in the branch delay slot, before 
executing the jump itself. 

Restrictions: 

Processor operation is UNPREDICTABLE if a branch, jump, ERET, DERET, or WAIT instruction is placed in the 
delay slot of a branch or jump. 

Operation: 

I: GPR [ 31 ] <— PC + 8 

I + 1:PC <r~ PC G p RLEN _;i_ 2s ! I instr_index | | 0 2 

Exceptions: 

None 

Programming Notes: 

Forming the branch target address by catenating PC and index bits rather 
advantage if all program code addresses fit into a 256 MB region aligned 
from anywhere in the region to anywhere in the region, an action not allowed by a signed relative offset. 

This definition creates the following boundary case: When the branch instruction is in the last word of a 256 MB 
region, it can branch only to the following 256 MB region containing the branch delay slot. 


than adding a signed offset to the PC is an 
on a 256 MB boundary. It allows a branch 
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Jump and Link Register 


JALR 


31 26 25 21 20 16 15 11 10 6 5 0 


SPECIAL 


0 



JALR 


rs 


rd 

hint 


000000 


00000 



001001 


6 5 5 

5 

5 

6 

Format: JALR rs (rd = 31 implied) 

JALR rd, rs 



MIPS32 

MIPS32 


Purpose: 

To execute a procedure call to an instruction address in a register 

Description: rd return_addr, PC rs 

Place the return address link in GPR rd. The return link is the address of the second instruction following the branch, 
where execution continues after a procedure call. 

For processors that do not implement the MIPS16e ASE: 

• Jump to the effective target address in GPR rs. Execute the instruction that follows the jump, in the branch delay 
slot, before executing the jump itself. 

For processors that do implement the MIPS16e ASE: 

• Jump to the effective target address in GPR rs. Execute the instruction that follows the jump, in the branch delay 
slot, before executing the jump itself. Set the ISA Mode bit to the value in GPR rs bit 0. Bit 0 of the target 
address is always zero so that no Address Exceptions occur when bit 0 of the source register is one 

In release 1 of the architecture, the only defined hint field value is 0, which sets default handling of JALR. In Release 
2 of the architecture, bit 10 of the hint field is used to encode a hazard barrier. See the JALR.HB instruction descrip- 
tion for additional information. 

Restrictions: 

Register specifiers rs and rd must not be equal, because such an instruction does not have the same effect when reex- 
ecuted. The result of executing such an instruction is UNPREDICTABLE. This restriction permits an exception han- 
dler to resume execution by re-executing the branch when an exception occurs in the branch delay slot. 

The effective target address in GPR rs must be naturally-aligned. For processors that do not implement the MIPS16e 
ASE, if either of the two least-significant bits are not zero, an Address Error exception occurs when the branch target 
is subsequently fetched as an instruction. For processors that do implement the MIPS16e ASE, if bit 0 is zero and bit 
1 is one, an Address Error exception occurs when the jump target is subsequently fetched as an instruction. 

Processor operation is UNPREDICTABLE if a branch, jump, ERET, DERET, or WAIT instruction is placed in the 
delay slot of a branch or jump. 
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Jump and Link Register, cont. 


JALR 


Operation: 

I: temp <— GPR[rs] 

GPR [ rd] <r- PC + 8 
I+l:if Configl ca = 0 then 
PC <— t emp 
else 

PC <— tempQpj^Lgj,.]^ _ .I | ] 0 

ISAMode <r- tempg 
endif 


Exceptions: 

None 

Programming Notes: 

This is the only branch-and-link instruction that can select a register for the return link; all other link instructions use 
GPR 31. The default register for GPR rd, if omitted in the assembly language instruction, is GPR 31. 
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Jump and Link Register with Hazard Barrier 


JALR.HB 


31 26 25 21 20 16 15 11 10 9 6 5 0 


SPECIAL 

000000 

rs 

0 

00000 

rd 

1 

Any other legal 
hint value 

JALR 

001001 

6 

5 

5 

5 

1 

4 

6 


Format: JALR.HB rs (rd = 31 implied) 

JALR.HB rd, rs 


MIPS32 Release 2 
MIPS32 Release 2 


Purpose: 

To execute a procedure call to an instruction address in a register and clear all execution and instruction hazards 

Description: rd return_addr, PC <— rs, clear execution and instruction hazards 

Place the return address link in GPR rd. The return link is the address of the second instruction following the branch, 
where execution continues after a procedure call. 

For processors that do not implement the MIPS 16 ASE: 

• Jump to the effective target address in GPR rs. Execute the instruction that follows the jump, in the branch delay 
slot, before executing the jump itself. 

For processors that do implement the MIPS 16 ASE: 

• Jump to the effective target address in GPR rs. Execute the instruction that follows the jump, in the branch delay 
slot, before executing the jump itself. Set the ISA Mode bit to the value in GPR rs bit 0. Bit 0 of the target 
address is always zero so that no Address Exceptions occur when bit 0 of the source register is one 

JALR.HB implements a software barrier that resolves all execution and instruction hazards created by Coprocessor 0 
state changes (for Release 2 implementations, refer to the SYNCI instruction for additional information on resolving 
instruction hazards created by writing the instruction stream). The effects of this barrier are seen starting with the 
instruction fetch and decode of the instruction at the PC to which the JALR.HB instruction jumps. An equivalent bar- 
rier is also implemented by the ERET instruction, but that instruction is only available if access to Coprocessor 0 is 
enabled, whereas JALR.HB is legal in all operating modes. 

This instruction clears both execution and instruction hazards. Refer to the EHB instruction description for the 
method of clearing execution hazards alone. 

JALR.HB uses bit 10 of the instruction (the upper bit of the hint field) to denote the hazard barrier operation. 

Restrictions: 

Register specifiers rs and rd must not be equal, because such an instruction does not have the same effect when reex- 
ecuted. The result of executing such an instruction is UNPREDICTABLE. This restriction permits an exception han- 
dler to resume execution by re-executing the branch when an exception occurs in the branch delay slot. 

The effective target address in GPR rs must be naturally-aligned. For processors that do not implement the MIPS 16 
ASE, if either of the two least-significant bits are not zero, an Address Error exception occurs when the branch target 
is subsequently fetched as an instruction. For processors that do implement the MIPS 16 ASE, if bit 0 is zero and bit 1 
is one, an Address Error exception occurs when the jump target is subsequently fetched as an instruction. 
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Jump and Link Register with Hazard Barrier, cont. 


JALR.HB 


Restrictions, cont.: 

After modifying an instruction stream mapping or writing to the instruction stream, execution of those instructions 
has UNPREDICTABLE behavior until the instruction hazard has been cleared with JALR.HB, JR.HB, ERET, or 
DERET. Further, the operation is UNPREDICTABLE if the mapping of the current instruction stream is modified. 

JALR.HB does not clear hazards created by any instruction that is executed in the delay slot of the JALR.HB. Only 
hazards created by instructions executed before the JALR.HB are cleared by the JALR.HB. 

Processor operation is UNPREDICTABLE if a branch, jump, ERET, DERET, or WAIT instruction is placed in the 
delay slot of a branch or jump. 

Operation: 

I: temp <— GPR[rs] 

GPR [ rd] <— PC + 8 
1+1 : if Configl CA = 0 then 
PC <— t emp 
else 

PC <— temp GPRLEN _ 1 _ ^ i ] 0 
ISAMode <r— tempg 
endif 

ClearHazards ( ) 

Exceptions: 

None 

Programming Notes: 

JALR and JALR.HB are the only branch-and-link instructions that can select 
link instructions use GPR 3 1 . The default register for GPR rd, if omitted in 
GPR 31. 

This instruction implements the final step in clearing execution and instruction hazards before execution continues. A 
hazard is created when a Coprocessor 0 or TLB write affects execution or the mapping of the instruction stream, or 
after a write to the instruction stream. When such a situation exists, software must explicitly indicate to hardware that 
the hazard should be cleared. Execution hazards alone can be cleared with the EHB instruction. Instruction hazards 
can only be cleared with a JR.HB, JALR.HB, or ERET instruction. These instructions cause hardware to clear the 
hazard before the instruction at the target of the jump is fetched. Note that because these instructions are encoded as 
jumps, the process of clearing an instruction hazard can often be included as part of a call (JALR) or return (JR) 
sequence, by simply replacing the original instructions with the HB equivalent. 


a register for the return link; all other 
the assembly language instruction, is 
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Jump and Link Register with Hazard Barrier, cont. 


JALR.HB 


Example: Clearing hazards due to an ASID change 

/* 

* Code used to modify ASID and call a routine with the new 

* mapping established. 

•k 

* aO = New ASID to establish 

* al = Address of the routine to call 


mf c0 

vO, 

C0_EntryHi 

/* 

Read current ASID */ 

li 

vl, 

~M_EntryHiASID 

/* 

Get negative mask for field */ 

and 

vO, 

vO , vl 

/* 

Clear out current ASID value */ 

or 

vO, 

vO , aO 

/* 

OR in new ASID value */ 

mtcO 

vO, 

C0_EntryHi 

/* 

Rewrite EntryHi with new ASID */ 

jalr . 

. hb al 


/* 

Call routine, clearing the hazard 


nop 
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Jump Register 


JR 


31 26 25 21 20 11 10 6 5 0 


SPECIAL 


0 


JR 


rs 


hint 


000000 


00 0000 0000 


001000 


6 

5 

10 

5 

6 

Format: JR rs 




MIPS32 

Purpose: 






To execute a branch to an instruction address in a register 

Description: pc 4— rs 

Jump to the effective target address in GPR rs. Execute the instruction following the jump, in the branch delay slot, 
before jumping. 

For processors that implement the MIPS16e ASE, set the ISA Mode bit to the value in GPR rs bit 0. Bit 0 of the target 
address is always zero so that no Address Exceptions occur when bit 0 of the source register is one 

Restrictions: 

The effective target address in GPR rs must be naturally-aligned. For processors that do not implement the MIPS16e 
ASE, if either of the two least-significant bits are not zero, an Address Error exception occurs when the branch target 
is subsequently fetched as an instruction. For processors that do implement the MIPS16e ASE, if bit 0 is zero and bit 
1 is one, an Address Error exception occurs when the jump target is subsequently fetched as an instruction. 

In release 1 of the architecture, the only defined hint field value is 0, which sets default handling of JR. In Release 2 
of the architecture, bit 10 of the hint field is used to encode an instruction hazard barrier. See the JR.HB instruction 
description for additional information. 

Processor operation is UNPREDICTABLE if a branch, jump, ERET, DERET, or WAIT instruction is placed in the 
delay slot of a branch or jump. 

Operation: 

I: temp 4— GPR[rs] 

I+l:if Configl CA = 0 then 
PC 4— t emp 
else 

PC 4— temp GPRLE{ j_ 1 _ i | | 0 
ISAMode 4— tempg 
endif 

Exceptions: 

None 
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Jump Register, cont. 


JR 


Programming Notes: 

Software should use the value 3 1 for the rs field of the instruction word on return from a JAL, JALR, or BGEZAL, 
and should use a value other than 3 1 for remaining uses of JR. 
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Jump Register with Hazard Barrier 


JR.HB 


31 26 25 21 20 11 10 9 6 5 0 


SPECIAL 

000000 

rs 

0 

00 0000 0000 

1 

Any other legal 
hint value 

JR 

001000 

6 

5 

10 

1 

4 

6 


Format: jr.hb rs MIPS32 Release 2 

Purpose: 

To execute a branch to an instruction address in a register and clear all execution and instruction hazards. 

Description: PC 4— rs, clear execution and instruction hazards 

Jump to the effective target address in GPR rs. Execute the instruction following the jump, in the branch delay slot, 
before jumping. 

JR.HB implements a software barrier that resolves all execution and instruction hazards created by Coprocessor 0 
state changes (for Release 2 implementations, refer to the SYNCI instruction for additional information on resolving 
instruction hazards created by writing the instruction stream). The effects of this barrier are seen starting with the 
instruction fetch and decode of the instruction at the PC to which the JR.HB instruction jumps. An equivalent barrier 
is also implemented by the ERET instruction, but that instruction is only available if access to Coprocessor 0 is 
enabled, whereas JR.HB is legal in all operating modes. 

This instruction clears both execution and instruction hazards. Refer to the EHB instruction description for the 
method of clearing execution hazards alone. 

JR.HB uses bit 10 of the instruction (the upper bit of the hint field) to denote the hazard barrier operation. 

For processors that implement the MIPS 16 ASE, set the ISA Mode bit to the value in GPR rs bit 0. Bit 0 of the target 
address is always zero so that no Address Exceptions occur when bit 0 of the source register is one. 

Restrictions: 

The effective target address in GPR rs must be naturally-aligned. For processors that do not implement the MIPS 16 
ASE, if either of the two least-significant bits are not zero, an Address Error exception occurs when the branch target 
is subsequently fetched as an instruction. For processors that do implement the MIPS 16 ASE, if bit 0 is zero and bit 1 
is one, an Address Error exception occurs when the jump target is subsequently fetched as an instruction. 

After modifying an instruction stream mapping or writing to the instruction stream, execution of those instructions 
has UNPREDICTABLE behavior until the hazard has been cleared with JALR.HB, JR.HB, ERET, or DERET. Fur- 
ther, the operation is UNPREDICTABLE if the mapping of the current instruction stream is modified. 

JR.HB does not clear hazards created by any instruction that is executed in the delay slot of the JALR.HB. Only haz- 
ards created by instructions executed before the JR.HB are cleared by the JALR.HB. 

Processor operation is UNPREDICTABLE if a branch, jump, ERET, DERET, or WAIT instruction is placed in the 
delay slot of a branch or jump. 
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Jump Register with Hazard Barrier, cont. 


JR.HB 


Operation: 

I: temp <— GPR[rs] 

I+l:if Configl CA = 0 then 
PC <— t emp 
else 

PC <— temp GPRLEN _ 1 _ [ ] 0 
ISAMode <— tempg 
endif 

ClearHazards ( ) 

Exceptions: 

None 

Programming Notes: 

This instruction implements the final step in clearing execution and instruction hazards before execution continues. A 
hazard is created when a Coprocessor 0 or TLB write affects execution or the mapping of the instruction stream, or 
after a write to the instruction stream. When such a situation exists, software must explicitly indicate to hardware that 
the hazard should be cleared. Execution hazards alone can be cleared with the EHB instruction. Instruction hazards 
can only be cleared with a JR.HB, JALR.HB, or ERET instruction. These instructions cause hardware to clear the 
hazard before the instruction at the target of the jump is fetched. Note that because these instructions are encoded as 
jumps, the process of clearing an instruction hazard can often be included as part of a call (JALR) or return (JR) 
sequence, by simply replacing the original instructions with the HB equivalent. 

Example: Clearing hazards due to an ASID change 


/* 

* Routine called to modify ASID 

* mapping established. 

•k 

* aO = New ASID to establish 
*/ 


mf c0 

vO, 

C0_EntryHi 

/* 

li 

vl, 

~M_EntryHiASID 

/* 

and 

vO, 

vO , vl 

/* 

or 

vO, 

vO , aO 

/* 

mtcO 

vO , 

C0_EntryHi 

/* 

jr.hb 

ra 


/* 


nop 

Example: Making a write to the instruction 


and return with the new 


Read current ASID */ 

Get negative mask for field */ 
Clear out current ASID value */ 
OR in new ASID value */ 

Rewrite EntryHi with new ASID */ 
Return, clearing the hazard */ 

n visible 


/* 

* Routine called after new instructions are written to 

* make them visible and return with the hazards cleared. 

*/ 

{Synchronize the caches - see the SYNCI and CACHE instructions} 
sync /* Force memory synchronization */ 

jr.hb ra /* Return, clearing the hazard */ 

nop 
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Jump Register with Hazard Barrier, cont. 


JR.HB 


Example: Clearing instruction hazards in-line 


la AT, lOf 
j r . hb AT 
nop 


/* Jump to next instruction, clearing */ 
/* hazards */ 
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Load Byte 


LB 


31 26 25 21 20 16 15 


0 


LB 

100000 


base 


rt 


offset 


6 


5 


5 


16 


Format: LB rt, offset (base) MIPS32 

Purpose: 

To load a byte from memory as a signed value 

Description: rt <— memory [base+of f set ] 

The contents of the 8-bit byte at the memory location specified by the effective address are fetched, sign-extended, 
and placed in GPR rt. The 16-bit signed offset is added to the contents of GPR base to form the effective address. 

Restrictions: 

None 

Operation: 

vAddr sign_extend (off set) + GPR [base] 

(pAddr, CCA ) <— AddressTranslation (vAddr, DATA, LOAD) 

p 

pAddr <— pAddr PSIZE _ 1 _ 2 1 I (pAddr 1--0 xor ReverseEndian ) 
memwordf- LoadMemory (CCA, BYTE, pAddr, vAddr, DATA) 
byte <— vAddr li>0 xor BigEndianCPU 2 
GPR [rt ] 4- sign_extend (memword 7+8 * byte _ . 8*byte) 

Exceptions: 

TLB Refill, TLB Invalid, Address Error, Watch 
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Load Byte Unsigned 


LBU 


31 26 25 21 20 16 15 


0 


LBU 

100100 


base 


rt 


offset 


6 


5 


5 


16 


Format: LBU rt, offset (base) MIPS32 

Purpose: 

To load a byte from memory as an unsigned value 

Description: rt <— memory [base+of f set ] 

The contents of the 8-bit byte at the memory location specified by the effective address are fetched, zero-extended, 
and placed in GPR rt. The 16-bit signed offset is added to the contents of GPR base to form the effective address. 

Restrictions: 

None 

Operation: 

vAddr sign_extend (off set) + GPR [base] 

(pAddr, CCA ) <— AddressTranslation (vAddr, DATA, LOAD) 

p 

pAddr <— pAddr PSIZE _ 1 _ 2 1 I (pAddr 1--0 xor ReverseEndian ) 
memwordf- LoadMemory (CCA, BYTE, pAddr, vAddr, DATA) 
byte <— vAddr li>0 xor BigEndianCPU 2 
GPR [rt ] <— zero_extend (memword 7+8 * byte _ . 8*byte) 

Exceptions: 

TLB Refill, TLB Invalid, Address Error, Watch 
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LDC1 


Load Doubleword to Floating Point 


31 

26 

25 

21 

20 

16 

15 

0 

LDC1 

110101 

base 

ft 

offset 

6 



5 


5 

16 


Format: 

LDC1 

ft, 

offset (base) 



MIPS32 

Purpose: 









To load a doubleword from memory to an FPR 

Description: ft <— memory [base+off set] 

The contents of the 64-bit doubleword at the memory location specified by the aligned effective address are fetched 
and placed in FPR ft. The 16-bit signed offset is added to the contents of GPR base to form the effective address. 

Restrictions: 

An Address Error exception occurs if EffectiveAddresso () A 0 (not doubleword-aligned). 

Operation: 

vAddr <r- sign_extend (of f set ) + GPR[base] 

if vAddr 2 _ 0 A 0 3 then 

SignalException (AddressError ) 
endif 

(pAddr, CCA) <— AddressTranslation (vAddr, DATA, LOAD) 
paddr paddr xor ( (BigEndianCPU xor ReverseEndian) || 0 2 ) 
memlsw <— LoadMemory (CCA, WORD, pAddr, vAddr, DATA) 
paddr paddr xor 2#100 

memmsw <— LoadMemory (CCA, WORD, pAddr, vAddr+4, DATA) 

StoreFPR ( ft , UNINTERPRETED_WORD , memlsw) 

StoreFPR (ft+1, UNINTERPRETED_WORD , memmsw) 

Exceptions: 

Coprocessor Unusable, Reserved Instruction, TLB Refill, TLB Invalid, Address Error, Watch 
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Load Doubleword to Coprocessor 2 


LDC2 


31 

26 

25 

21 

20 

16 

15 

0 

LDC2 

110110 

base 

rt 

offset 

6 



5 


5 

16 


Format: 

LDC2 

rt, 

offset (base) 



MIPS32 

Purpose: 









To load a doubleword from memory to a Coprocessor 2 register 

Description: rt 4— memory [base+off set] 

The contents of the 64-bit doubleword at the memory location specified by the aligned effective address are fetched 
and placed in Coprocessor 2 register rt. The 16-bit signed offset is added to the contents of GPR base to form the 
effective address. 

Restrictions: 

An Address Error exception occurs if EffectiveAddresso () A 0 (not doubleword-aligned). 

Operation: 

vAddr 4— sign_extend (off set) + GPR [base] 

o 

if vAddr 2 _ 0 A 0 then SignalException (AddressError ) endif 
(pAddr, CCA) 4 — AddressTranslation (vAddr, DATA, LOAD) 
paddr 4 — paddr xor ( (BigEndianCPU xor ReverseEndian) || 0 2 ) 
memlsw 4 — LoadMemory (CCA, WORD, pAddr, vAddr, DATA) 
paddr 4 — paddr xor 2#100 

memmsw 4 — LoadMemory (CCA, WORD, pAddr, vAddr+4, DATA) 

memlsw 

memmsw 

Exceptions: 

Coprocessor Unusable, Reserved Instruction, TLB Refill, TLB Invalid, Address Error, Watch 
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Load Doubleword Indexed to Floating Point 


LDXC1 


31 26 25 21 20 16 15 11 10 6 5 0 


COP IX 



0 


LDXC1 


base 

index 


fd 


010011 



00000 


000001 


6 5 

5 

5 

5 

6 

Format: LDXC1 fd, index (base) 




MIPS64 





MIPS32 Release 2 

Purpose: 






To load a doubleword from memory to an FPR (GPR+GPR addressing) 

Description: fd <— memory [base+index] 

The contents of the 64-bit doubleword at the memory location specified by the aligned effective address are fetched 
and placed in FPR fd. The contents of GPR index and GPR base are added to form the effective address. 

Restrictions: 

An Address Error exception occurs if EffectiveAddresso () A 0 (not doubleword-aligned). 


Operation: 

vAddr <r- GPR[base] + GPR[index] 
if vAddr 2 q A 0 3 then 

SignalException (AddressError ) 
endif 

(pAddr, CCA) <— AddressTranslation (vAddr, DATA, LOAD) 
paddr <— paddr xor ( (BigEndianCPU xor ReverseEndian) || 0 2 ) 
memlsw <— LoadMemory (CCA, WORD, pAddr, vAddr, DATA) 
paddr <— paddr xor 2#100 

memmsw <— LoadMemory (CCA, WORD, pAddr, vAddr+4, DATA) 

StoreFPR ( f t , UNINTERPRETED_WORD , memlsw) 

StoreFPR (ft+1, UNINTERPRETED_WORD , memmsw) 

Exceptions: 

TLB Refill, TLB Invalid, Address Error, Reserved Instruction, Coprocessor Unusable, Watch 
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Load Halfword 


LH 


31 26 25 21 20 16 15 


0 


LH 

100001 


base 


rt 


offset 


6 


5 


5 


16 


Format: LH rt, offset (base) MIPS32 

Purpose: 

To load a halfword from memory as a signed value 

Description: rt 4— memory [basetof f set ] 

The contents of the 16-bit halfword at the memory location specified by the aligned effective address are fetched, 
sign-extended, and placed in GPR rt. The 16-bit signed offset is added to the contents of GPR base to form the effec- 
tive address. 

Restrictions: 

The effective address must be naturally-aligned. If the least-significant bit of the address is non-zero, an Address 
Error exception occurs. 

Operation: 

vAddr 4— sign_extend (off set) + GPR [base] 
if vAddr 0 -t- 0 then 

SignalException (AddressError ) 
endif 

(pAddr, CCA) 4— AddressTranslation (vAddr, DATA, LOAD) 
pAddr 4— pAddr PSIZE _ 1 .. 2 I I (pAddr li 0 xor (ReverseEndian | 0)) 

memword 4— LoadMemory (CCA, HALFWORD, pAddr, vAddr, DATA) 
byte 4— vAddr-L __g xor (BigEndianCPU | 0) 

GPR[rt] 4— sign_extend (memwordxg+gi^ytg^ 8 *byte) 

Exceptions: 

TLB Refill, TLB Invalid, Bus Error, Address Error, Watch 
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Load Halfword Unsigned 


LHU 


31 26 25 21 20 16 15 


0 


LHU 

100101 


base 


rt 


offset 


6 


5 


5 


16 


Format: LHU rt, offset (base) MIPS32 

Purpose: 

To load a halfword from memory as an unsigned value 

Description: rt 4— memory [base+of f set ] 

The contents of the 16-bit halfword at the memory location specified by the aligned effective address are fetched, 
zero-extended, and placed in GPR rt. The 16-bit signed offset is added to the contents of GPR base to form the effec- 
tive address. 

Restrictions: 

The effective address must be naturally-aligned. If the least-significant bit of the address is non-zero, an Address 
Error exception occurs. 

Operation: 

vAddr 4— sign_extend (off set) + GPR [base] 
if vAddr 0 -t- 0 then 

SignalException (AddressError ) 
endif 

(pAddr, CCA) 4— AddressTranslation (vAddr, DATA, LOAD) 
pAddr 4— pAddr PSIZE _ 1 .. 2 I I (pAddr li 0 xor (ReverseEndian | 0)) 

memword 4— LoadMemory (CCA, HALFWORD, pAddr, vAddr, DATA) 
byte 4— vAddr-L . _ 0 xor (BigEndianCPU | 0) 

GPR[rt] 4— zero_extend (memwordxg+gi^ytg^ 8 *byte) 

Exceptions: 

TLB Refill, TLB Invalid, Address Error, Watch 
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Load Linked Word 


LL 


31 26 25 21 20 16 15 


0 


LL 

110000 


base 


rt 


offset 


6 


5 


5 


16 


Format: LL rt, offset (base) MIPS32 

Purpose: 

To load a word from memory for an atomic read-modify-write 

Description: rt 4— memory [basetoff set] 

The LL and SC instructions provide the primitives to implement atomic read-modify-write (RMW) operations for 
synchronizable memory locations. 

The contents of the 32-bit word at the memory location specified by the aligned effective address are fetched and 
written into GPR rt. The 16-bit signed offset is added to the contents of GPR base to form an effective address. 

This begins a RMW sequence on the current processor. There can be only one active RMW sequence per processor. 
When an LL is executed it starts an active RMW sequence replacing any other sequence that was active. The RMW 
sequence is completed by a subsequent SC instruction that either completes the RMW sequence atomically and suc- 
ceeds, or does not and fails. 

Executing LL on one processor does not cause an action that, by itself, causes an SC for the same block to fail on 
another processor. 

An execution of LL does not have to be followed by execution of SC; a program is free to abandon the RMW 
sequence without attempting a write. 

Restrictions: 

The addressed location must be synchronizable by all processors and I/O devices sharing the location; if it is not, the 
result in UNPREDICTABLE. Which storage is synchronizable is a function of both CPU and system implementa- 
tions. See the documentation of the SC instruction for the formal definition. 

The effective address must be naturally-aligned. If either of the 2 least-significant bits of the effective address is 
non-zero, an Address Error exception occurs. 

Operation: 

vAddr 4— sign_extend (off set) + GPR [base] 
if vAddr-L q ■£■ 0 2 then 

SignalException (AddressError ) 
endif 

(pAddr, CCA) 4— AddressTranslation (vAddr, DATA, LOAD) 
memword 4— LoadMemory (CCA, WORD, pAddr, vAddr, DATA) 

GPR[rt] 4— memword 
LLbit 4- 1 
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Load Linked Word (cont.) 


LL 


Exceptions: 

TLB Refill, TLB Invalid, Address Error, Reserved Instruction, Watch 

Programming Notes: 

There is no Load Linked Word Unsigned operation corresponding to Load Word Unsigned. 
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Load Upper Immediate 


LUI 


31 26 25 21 20 16 15 0 


LUI 

0 





rt 

immediate 

001111 

00000 




6 5 5 16 


Format: LUI rt, immediate MIPS32 

Purpose: 

To load a constant into the upper half of a word 

Description: rt <— immediate | I 0 16 

The 16-bit immediate is shifted left 16 bits and concatenated with 16 bits of low-order zeros. The 32-bit result is 
placed into GPR rt. 

Restrictions: 

None 

Operation: 

GPR[rt] t— immediate I | 0 16 

Exceptions: 

None 
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Load Doubleword Indexed Unaligned to Floating Point 


LUXC1 


31 26 25 21 20 16 15 11 10 6 5 0 


COP IX 



0 


LUXC1 


base 

index 


fd 


010011 



00000 


000101 


6 5 

5 

5 

5 

6 

Format: LUXC1 fd, index (base) 




MIPS64 





MIPS32 Release 2 

Purpose: 






To load a doubleword from memory to an FPR (GPR+GPR addressing), ignoring alignment 

Description: fd memory [ (base + index) P s IZE - 1 .. 3 ] 

The contents of the 64-bit doubleword at the memory location specified by the effective address are fetched and 
placed into the low word of coprocessor 1 general register fd. The contents of GPR index and GPR base are added to 
form the effective address. The effective address is doubleword-aligned; EffectiveAddress 9 0 are ignored. 

Restrictions: 

The result of this instruction is UNPREDICTABLE if the processor is executing in 16 FP registers mode. 


Operation: 

vAddr (GPR [base ] +GPR [ index] ) 63 __ 3 ] | 0 3 

(pAddr, CCA) <— AddressTranslation (vAddr, DATA, LOAD) 
paddr <— paddr xor ( (BigEndianCPU xor ReverseEndian) || 0 2 ) 
memlsw <— LoadMemory (CCA, WORD, pAddr, vAddr, DATA) 
paddr <— paddr xor 2#100 

memmsw <— LoadMemory (CCA, WORD, pAddr, vAddr+4, DATA) 
StoreFPR ( ft , UNINTERPRETED_WORD , memlsw) 

StoreFPR (ft+1, UNINTERPRETED_WORD , memmsw) 

Exceptions: 

Coprocessor Unusable, Reserved Instruction, TLB Refill, TLB Invalid, Watch 
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Load Word 


LW 


31 26 25 21 20 16 15 


0 


LW 

100011 


base 


rt 


offset 


6 


5 


5 


16 


Format: LW rt, offset (base) MIPS32 

Purpose: 

To load a word from memory as a signed value 

Description: rt 4— memory [basetoff set] 

The contents of the 32-bit word at the memory location specified by the aligned effective address are fetched, 
sign-extended to the GPR register length if necessary, and placed in GPR rt. The 16-bit signed offset is added to the 
contents of GPR base to form the effective address. 

Restrictions: 

The effective address must be naturally-aligned. If either of the 2 least-significant bits of the address is non-zero, an 
Address Error exception occurs. 

Operation: 

vAddr 4— sign_extend (off set) + GPR [base] 
if vAddr lj>0 ^ 0 2 then 

SignalException (AddressError ) 
endif 

(pAddr, CCA) 4— AddressTranslation (vAddr, DATA, LOAD) 
memwordf- LoadMemory (CCA, WORD, pAddr, vAddr, DATA) 

GPR[rt]4— memword 

Exceptions: 

TLB Refill, TLB Invalid, Bus Error, Address Error, Watch 
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LWC1 


Load Word to Floating Point 


31 26 25 21 20 16 15 


0 


LWC1 

110001 


base 


rt 


offset 


6 


5 


5 


16 


Format: LWC1 ft, offset (base) MIPS32 

Purpose: 

To load a word from memory to an FPR 

Description: ft 4— memory [base+off set] 

The contents of the 32-bit word at the memory location specified by the aligned effective address are fetched and 
placed into the low word of coprocessor 1 general register/?. The 16-bit signed offset is added to the contents of 
GPR base to form the effective address. 

Restrictions: 

An Address Error exception occurs if EficctivcAddrcss | () A 0 (not word-aligned). 

Operation: 

vAddr 4— sign_extend (off set) + GPR [base] 
if vAddr-L ^ 0 2 then 

SignalException (AddressError) 
endif 

(pAddr, CCA) 4— AddressTranslation (vAddr, DATA, LOAD) 

memword 4— LoadMemory (CCA, WORD, pAddr, vAddr, DATA) 

StoreFPR ( ft , UNINTERPRETED_WORD, 
memword) 


Exceptions: 

TLB Refill, TLB Invalid, Address Error, Reserved Instruction, Coprocessor Unusable, Watch 
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LWC2 


Load Word to Coprocessor 2 


31 

26 
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21 
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15 

0 
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rt 

offset 

6 
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5 

16 


Format: 

LWC2 

rt, 

offset (base) 



MIPS32 

Purpose: 









To load a word from memory to a COP2 register 

Description: rt 4— memory [base+off set] 

The contents of the 32-bit word at the memory location specified by the aligned effective address are fetched and 
placed into the low word of COP2 (Coprocessor 2) general register rt. The 16-bit signed offset is added to the con- 
tents of GPR base to form the effective address. 

Restrictions: 

An Address Error exception occurs if EtfcctiveAddrcss | () A 0 (not word-aligned). 

Operation: 

vAddr 4— sign_extend (off set) + GPR [base] 
if vAddr 12-- o ^ 0 2 then 

SignalException (AddressError ) 
endif 

(pAddr, CCA) <— AddressTranslation (vAddr, DATA, LOAD) 
memword 4— LoadMemory (CCA, DOUBLEWORD, pAddr, vAddr, DATA) 

CPR[2,rt,0] 4— memword 

Exceptions: 

TLB Refill, TLB Invalid, Address Error, Reserved Instruction, Coprocessor Unusable, Watch 
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Load Word Left 


LWL 



Format: LWL rt, offset (base) MIPS32 

Purpose: 

To load the most-significant part of a word as a signed value from an unaligned memory address 

Description: rt 4— rt MERGE memory [base+of fset] 

The 16-bit signed offset is added to the contents of GPR base to form an effective address (EffAddr). EjfAddr is the 
address of the most-significant of 4 consecutive bytes forming a word (W) in memory starting at an arbitrary byte 
boundary. 

The most-significant 1 to 4 bytes of W is in the aligned word containing the EjfAddr. This part of W is loaded into the 
most-significant (left) part of the word in GPR rt. The remaining least-significant part of the word in GPR rt is 
unchanged. 

The figure below illustrates this operation using big-endian byte ordering for 32-bit and 64-bit registers. The 4 con- 
secutive bytes in 2.. 5 form an unaligned word starting at location 2. A part of W, 2 bytes, is in the aligned word con- 
taining the most-significant byte at 2. First, LWL loads these 2 bytes into the left part of the destination register word 
and leaves the right part of the destination word unchanged. Next, the complementary LWR loads the remainder of 
the unaligned word 

Figure 3-5 Unaligned Word Load Using LWL and LWR 

Word at byte 2 in big-endian memory; each memory byte contains its own address 
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LWL 


Load Word Left (con't) 


The bytes loaded from memory to the destination register depend on both the offset of the effective address within an 
aligned word, that is, the low 2 bits of the address (vAddi j 0 ), and the current byte-ordering mode of the processor 
(big- or little-endian). The figure below shows the bytes loaded for every combination of offset and byte ordering. 

Figure 3-6 Bytes Loaded by LWL Instruction 
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LWL 


Load Word Left (con't) 


Restrictions: 

None 

Operation: 

vAddr <r- sign_extend (off set) + GPR[base] 

(pAddr, CCA ) <— AddressTranslation (vAddr, DATA, LOAD) 
pAddr pAddr PSIZE _ 1 _ 2 I I (pAddr 1-0 xor ReverseEndian 2 ) 
if BigEndianMem = 0 then 

pAddrt— pAddr PSIZE _ 1 £ I I 0 2 
endif 

byte <— vAddr p xor BigEndianCPU 2 

memwordf- LoadMemory (CCA, byte, pAddr, vAddr, DATA) 
temp <- memword 7+8 * byte . .o [] GPR [ rt ] 23 -8*byte . . 0 
GPR[rt]<— temp 


Exceptions: 

None 

TLB Refill, TLB Invalid, Bus Error, Address Error, Watch 

Programming Notes: 

The architecture provides no direct support for treating unaligned words as unsigned values, that is, zeroing bits 
63.32 of the destination register when bit 31 is loaded. 

Historical Information 

In the MIPS I architecture, the LWL and LWR instructions were exceptions to the load-delay scheduling restriction. 
A LWL or LWR instruction which was immediately followed by another LWL or LWR instruction, and used the 
same destination register would correctly merge the 1 to 4 loaded bytes with the data loaded by the previous instruc- 
tion. All such restrictions were removed from the architecture in MIPS II. 
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LWR 


Load Word Right 


31 26 25 21 20 16 15 
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offset 
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Format: LWR rt, offset (base) MIPS32 

Purpose: 

To load the least-significant part of a word from an unaligned memory address as a signed value 

Description: rt 4— rt MERGE memory [base+of fset] 

The 16-bit signed offset is added to the contents of GPR base to form an effective address (EffAddr). EffAddr is the 
address of the least-significant of 4 consecutive bytes forming a word (W) in memory starting at an arbitrary byte 
boundary. 

A part of W. the least-significant 1 to 4 bytes, is in the aligned word containing EffAddr. This part of W is loaded into 
the least-significant (right ) part of the word in GPR rt. The remaining most-significant part of the word in GPR rt is 
unchanged. 

Executing both LWR and LWL, in either order, delivers a sign-extended word value in the destination register. 

The figure below illustrates this operation using big-endian byte ordering for 32-bit and 64-bit registers. The 4 con- 
secutive bytes in 2.. 5 form an unaligned word starting at location 2. A part of W, 2 bytes, is in the aligned word con- 
taining the least-significant byte at 5. First, LWR loads these 2 bytes into the right part of the destination register. 
Next, the complementary LWL loads the remainder of the unaligned word. 
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LWR 


Load Word Right (cont.) 

Figure 3-7 Unaligned Word Load Using LWL and LWR 

Word at byte 2 in big-endian memory; each memory byte contains its own address 



The bytes loaded from memory to the destination register depend on both the offset of the effective address within an 
aligned word, that is, the low 2 bits of the address (vAddij q), and the current byte-ordering mode of the processor 
(big- or little-endian). The figure below shows the bytes loaded for every combination of offset and byte ordering. 
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LWR 


Load Word Right (cont.) 

Figure 3-8 Bytes Loaded by LWL Instruction 
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LWR 


Load Word Right (cont.) 


Restrictions: 

None 

Operation: 

vAddr <r- sign_extend (off set) + GPR[base] 

(pAddr, CCA ) <— AddressTranslation (vAddr, DATA, LOAD) 
pAddr pAddr PSIZE _ 1 _ 2 I I (pAddr 1-0 xor ReverseEndian 2 ) 
if BigEndianMem = 0 then 

pAddrt— pAddr PSIZE _ 1 £ I I 0 2 
endif 

byte <— vAddr p xor BigEndianCPU 2 

memwordf- LoadMemory (CCA, byte, pAddr, vAddr, DATA) 
temp <- memword 31- . 32 -8*byte II GPR [ rt ] 31 - 8 *b Yte . . 0 
GPR[rt]<— temp 

Exceptions: 

TLB Refill, TLB Invalid, Bus Error, Address Error, Watch 

Programming Notes: 

The architecture provides no direct support for treating unaligned words as unsigned values, that is, zeroing bits 
63.32 of the destination register when bit 31 is loaded. 

Historical Information 

In the MIPS I architecture, the LWL and LWR instructions were exceptions to the load-delay scheduling restriction. 
A LWL or LWR instruction which was immediately followed by another LWL or LWR instruction, and used the 
same destination register would correctly merge the 1 to 4 loaded bytes with the data loaded by the previous instruc- 
tion. All such restrictions were removed from the architecture in MIPS II. 
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Load Word Indexed to Floating Point 


LWXC1 


31 26 25 21 20 16 15 11 10 6 5 0 


COP IX 
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6 

Format: 

LWXC1 fd, index (base) 




MIPS64 
MIPS32 Release 2 


Purpose: 

To load a word from memory to an FPR (GPR+GPR addressing) 

Description: fd <— memory [base+index] 

The contents of the 32-bit word at the memory location specified by the aligned effective address are fetched and 
placed into the low word of coprocessor 1 general register fd. The contents of GPR index and GPR base are added to 
form the effective address. 

Restrictions: 

An Address Error exception occurs if EffectiveAddress | () A 0 (not word-aligned). 

Operation: 

vAddr <— GPR [base] + GPR [index] 
if vAddr-^o ^ 0 2 then 

SignalException (AddressError ) 
endif 

(pAddr, CCA) <— AddressTranslation (vAddr, DATA, LOAD) 

memword <— LoadMemory (CCA, WORD, pAddr, vAddr, DATA) 

StoreFPR ( ft , UNINTERPRETED_WORD, 
memword) 


Exceptions: 

TLB Refill, TLB Invalid, Address Error, Reserved Instruction, Coprocessor Unusable, Watch 
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MADD 


Multiply and Add Word to Hi,Lo 
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Format: madd rs, rt 




MIPS32 

Purpose: 






To multiply two words and add the result to Hi, Lo 

Description: (hi,lo) «- (hi,lo) + (rs x rt) 

The 32-bit word value in GPR rs is multiplied by the 32-bit word value in GPR rt, treating both operands as signed 
values, to produce a 64-bit result. The product is added to the 64-bit concatenated values of HI and LO.. The most sig- 
nificant 32 bits of the result are written into HI and the least signficant 32 bits are written into LO. No arithmetic 
exception occurs under any circumstances. 

Restrictions: 

None 

This instruction does not provide the capability of writing directly to a target GPR. 

Operation: 

temp <- (HI | | LO) + (GPR [ rs ] X GPR[rt]) 

HI <- temp 63 _ _ 32 
LO <— temp 31 __ 0 

Exceptions: 

None 

Programming Notes: 

Where the size of the operands are known, software should place the shorter operand in GPR rt. This may reduce the 
latency of the instruction on those processors which implement data-dependent instruction latencies. 
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Floating Point Multiply Add 


MADD.fmt 
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Format: madd.s fd, fr, 

fs, 

ft 

MIPS64 
MIPS32 Release 2 

MADD . D fd, fr, 

fs, 

ft 

MIPS64 
MIPS32 Release 2 

MADD.PS fd, fr, 

■ f s, 

ft 

MIPS64 
MIPS32 Release 2 


Purpose: 

To perform a combined multiply-then-add of FP values 
Description: fd <- (fs x ft) + fr 

The value in FPR fs is multiplied by the value in FPR ft to produce an intermediate product. The value in FPR fr is 
added to the product. The result sum is calculated to infinite precision, rounded according to the current rounding 
mode in FCSR , and placed into FPR fd. The operands and result are values in format 

MADD.PS multiplies then adds the upper and lower halves of FPR fr, FPR fs, and FPR ft independently, and ORs 
together any generated exceptional conditions. 

Cause bits are ORed into the Flag bits if no exception is taken. 

Restrictions: 

The fields fr,fs,ft, and fd must specify FPRs valid for operands of type/wf; if they are not valid, the result is UNPRE- 
DICTABLE. 

The operands must be values in format fmt', if they are not, the result is UNPREDICTABLE and the value of the 
operand FPRs becomes UNPREDICTABLE. 

The result of MADD.PS is UNPREDICTABLE if the processor is executing in 16 FP registers mode. 


Operation: 

vfr ValueFPR(fr, fmt) 
vfs <r- ValueFPR(fs, fmt) 
vft ValueFPR(ft, fmt) 

StoreFPR(fd, fmt, (vfs X fmt vft) + fmt vfr) 
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Floating Point Multiply Add (cont.) 


MADD.fmt 


Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Floating Point Exceptions: 

Inexact, Unimplemented Operation, Invalid Operation, Overflow, Underflow 


174 


MIPS32™ Architecture For Programmers Volume II, Revision 2.00 


Copyright © 2001-2003 MIPS Technologies Inc. All rights reserved. 




MADDU 


Multiply and Add Unsigned Word to Hi,Lo 
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SPECIAL2 



0 

0 

MADDU 


rs 

rt 




011100 



00000 

00000 

000001 


6 5 

5 

5 

5 

6 

Format: maddu rs, rt 




MIPS32 

Purpose: 






To multiply two unsigned words and add the result to Hi, Lo. 

Description: (hi,lo) <- (hi,lo) + (rs x rt) 

The 32-bit word value in GPR rs is multiplied by the 32-bit word value in GPR rt, treating both operands as unsigned 
values, to produce a 64-bit result. The product is added to the 64-bit concatenated values of HI and LO.. The most sig- 
nificant 32 bits of the result are written into HI and the least signficant 32 bits are written into LO. No arithmetic 
exception occurs under any circumstances. 

Restrictions: 

None 

This instruction does not provide the capability of writing directly to a target GPR. 

Operation: 

temp <- (HI | | LO) + (GPR [ rs ] X GPR[rt]) 

HI <- temp 63 _ _ 32 
LO <— temp 31 __ 0 

Exceptions: 

None 

Programming Notes: 

Where the size of the operands are known, software should place the shorter operand in GPR rt. This may reduce the 
latency of the instruction on those processors which implement data-dependent instruction latencies. 
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MFCO 


Move from Coprocessor 0 


31 26 25 21 20 16 15 11 10 3 2 0 


COPO 

MF 



0 




rt 

rd 


sel 

010000 

00000 



00000000 



6 

5 

5 

5 

8 

3 

Format: MFCO rt, 

rd 




MIPS32 

MFCO rt, 

rd, sel 




MIPS32 

Purpose: 







To move the contents of a coprocessor 0 register to a general register. 

Description: rt 4- cpr [ 0 , rd, sel ] 

The contents of the coprocessor 0 register specified by the combination of rd and sel are loaded into general register 
rt. Note that not all coprocessor 0 registers support the sel field. In those instances, the sel field must be zero. 

Restrictions: 

The results are UNDEFINED if coprocessor 0 does not contain a register as specified by rd and sel. 

Operation: 

data <— CPR [ 0 , rd, sel ] 

GPR[rt] data 

Exceptions: 

Coprocessor Unusable 
Reserved Instruction 
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MFC1 


Move Word From Floating Point 
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Format: MFC1 rt, fs 




MIPS32 

Purpose: 






To copy a word from an FPU (CPI) general register to a GPR 

Description: rt <- fs 

The contents of FPR fs are loaded into general register rt. 

Restrictions: 

Operation: 

data <- ValueFPR (fs, UNINTERPRETED_WORD ) 

GPR [ rt ] 4— data 

Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Historical Information: 

For MIPS I, MIPS II, and MIPS III the contents of GPR rt are UNPREDICTABLE for the instruction immediately 
following MFC1. 


MIPS32™ Architecture For Programmers Volume II, Revision 2.00 


177 


Copyright © 2001-2003 MIPS Technologies Inc. All rights reserved. 




Move Word From Coprocessor 2 
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Format: 

MFC2 rt, 
MFC2, rt, 

rd 

■ rd, 

sel 





MIPS32 

MIPS32 


The syntax shown above is an example using MFC1 as a model. The specific syntax is implementation dependent. 

Purpose: 

To copy a word from a COP2 general register to a GPR 

Description: rt <- CP2CPR[impl] 

The contents of the coprocessor 2 register denoted by the Impl field are and placed into general register rt. The inter- 
pretation of the Impl field is left entirely to the Coprocessor 2 implementation and is not specified by the architecture. 

Restrictions: 

The results are UNPREDICTABLE if Impl specifies a coprocessor 2 register that does not exist. 

Operation: 

data <— CP2CPR [ Impl ] 

GPR[rt] <— data 

Exceptions: 

Coprocessor Unusable 
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MFHC1 


Move Word From High Half of Floating Point Register 
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Format: mfhci rt, fs 



MIPS32 Release 2 

Purpose: 





To copy a word from the high half of an FPU (CPI) general register to a GPR 

Description: rt <- fs 63 32 

The contents of the high word of FPR fs are loaded into general register rt. This instruction is primarily intended to 
support 64-bit floating point units on a 32-bit CPU, but the semantics of the instruction are defined for all cases. 

Restrictions: 

In implementations prior to Release 2 of the architecture, this instruction resulted in a Reserved Instruction Excep- 
tion. 

The results are UNPREDICTABLE if Status FR = 0 antl/.'v is odd. 

Operation: 

data <- ValueFPR (fs, UNINTERPRETED_DOUBLEWORD ) 63- -32 
GPR[rt] <— data 

Exceptions: 

Coprocessor Unusable 
Reserved Instruction 
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Move Word From High Half of Coprocessor 2 Register 


MFHC2 


31 

26 25 

21 20 


16 15 

11 10 

3 2 0 

COP2 

010010 

MFH 

00011 


rt 


Impl 


6 

5 


5 


16 


Format: 

MFHC 2 rt, rd 
MFHC 2 , rt, rd, 

sel 




MIPS32 Release 2 
MIPS32 Release 2 


The syntax shown above is an example using MFHC1 as a model. The specific syntax is implementation dependent. 

Purpose: 

To copy a word from the high half of a COP2 general register to a GPR 

Description: rt <- CP2CPR[impl] 63 _ _ 32 

The contents of the high word of the coprocessor 2 register denoted by the Impl field are placed into GPR rt. The 
interpretation of the Impl field is left entirely to the Coprocessor 2 implementation and is not specified by the archi- 
tecture. 

Restrictions: 

The results are UNPREDICTABLE if Impl specifies a coprocessor 2 register that does not exist, or if that register is 
not 64 bits wide. 

In implementations prior to Release 2 of the architecture, this instruction resulted in a Reserved Instruction Excep- 
tion. 

Operation: 

data <— CP 2 CPR [ Impl ] 63 _ _ 32 
GPR [ rt ] 4— data 

Exceptions: 

Coprocessor Unusable 
Reserved Instruction 
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MFHI 


Move From HI Register 


31 26 25 16 15 11 10 6 5 0 


SPECIAL 

0 


0 

MFHI 



rd 



000000 

00 0000 0000 


00000 

010000 


6 

10 

5 

5 

6 

Format: mfhi rd 




MIPS32 


Purpose: 

To copy the special purpose HI register to a GPR 

Description: rd f- hi 

The contents of special register HI are loaded into GPR rd. 

Restrictions: 

None 

Operation: 

GPR [ rd] <- HI 

Exceptions: 

None 

Historical Information: 

In the MIPS I, II, and III architectures, the two instructions which follow the MFHI must not moodify the HI register. 
If this restriction is violated, the result of the MFHI is UNPREDICTABLE. This restriction was removed in MIPS 
IV and MIPS32, and all subsequent levels of the architecture. 
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Move From LO Register 


MFLO 


31 26 25 16 15 11 10 6 5 0 


SPECIAL 

0 


0 

MFLO 



rd 



000000 

00 0000 0000 


00000 

010010 


6 


10 

5 

5 

6 

Format: mflo 

rd 




MIPS32 


Purpose: 

To copy the special purpose LO register to a GPR 

Description: rd 4— lo 

The contents of special register LO are loaded into GPR rd. 

Restrictions: None 

Operation: 

GPR [ rd] 4- LO 

Exceptions: 

None 

Historical Information: 

In the MIPS I, II, and III architectures, the two instructions which follow the MFHI must not moodify the HI register. 
If this restriction is violated, the result of the MFHI is UNPREDICTABLE. This restriction was removed in MIPS 
IV and MIPS32, and all subsequent levels of the architecture. 
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Floating Point Move 


MOV.fmt 


31 26 25 21 20 16 15 11 10 6 5 0 


COP1 


0 



MOV 


fmt 


fs 

fd 


010001 


00000 



000110 


6 

5 

5 

5 

5 

6 

Format: 

MOV . S f d , f s 




MIPS32 


MOV . D f d , f s 




MIPS32 


MOV.PS fd, fs 




MIPS64 






MIPS32 Release 2 


Purpose: 

To move an FP value between FPRs 

Description: fd <- fs 

The value in FPR/s is placed into FPR fd. The source and destination are values in format In paired-single for- 
mat, both the halves of the pair are copied to fd. 

The move is non-arithmetic; it causes no IEEE 754 exceptions. 

Restrictions: 

The fields fs and fd must specify FPRs valid for operands of type fmt; if they are not valid, the result is UNPRE- 
DICTABLE. 

The operand must be a value in format fmt; if it is not, the result is UNPREDICTABLE and the value of the operand 
FPR becomes UNPREDICTABLE. 

The result of MOV.PS is UNPREDICTABLE if the processor is executing in 16 FP registers mode. 


Operation: 

StoreFPR(fd, fmt, ValueFPR(fs, fmt)) 

Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Floating Point Exceptions: 

Unimplemented Operation 
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MOVF 


Move Conditional on Floating Point False 


31 26 25 21 20 18 17 16 15 11 10 6 5 0 


SPECIAL 



0 

tf 


0 

MOVCI 


rs 

cc 



rd 



000000 



0 

0 


00000 

000001 


6 5 

3 

1 1 

5 

5 

6 

Format: movf rd, rs, cc 





MIPS32 

Purpose: 







To test an FP condition code then conditionally move a GPR 

Description: if cc = 0 then rd <— rs 

If the floating point condition code specified by CC is zero, then the contents of GPR rs are placed into GPR rd. 

Restrictions: 

Operation: 

if FPConditionCode (cc) = 0 then 
GPR [ rd] <- GPR [ rs ] 
endif 


Exceptions: 

Reserved Instruction, Coprocessor Unusable 
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Floating Point Move Conditional on Floating Point False 


MOVF.fmt 


31 

26 25 

21 20 

18 17 

16 15 


11 10 

6 5 0 

COP1 

fmt 

CC 

0 

tf 

fs 

fd 

MOVCF 

010001 



0 

0 



010001 

6 

5 

3 

l 

1 

5 

5 

6 

Format: 

MOVF.S fd, fs, 

cc 





MIPS32 


MOVF.D fd, fs, 

cc 





MIPS32 


MOVF.PS fd, fs, 

. cc 





MIPS64 








MIPS32 Release 2 


Purpose: 

To test an FP condition code then conditionally move an FP value 

Description: if cc = 0 then fd <— fs 

If the floating point condition code specified by CC is zero, then the value in FPR /.v is placed into FPR fd. The source 
and destination are values in format fmt. 

If the condition code is not zero, then FPR /s is not copied and FPR fd retains its previous value in format fmt. If fd did 
not contain a value either in format fmt or previously unused data from a load or move-to operation that could be 
interpreted in format then the value of fd becomes UNPREDICTABLE. 

MOVF.PS conditionally merges the lower half of FPR fs into the lower half of FPR fd if condition code CC is zero, 
and independently merges the upper half of FPR fs into the upper half of FPR /of if condition code CC+1 is zero. The 
CC field must be even; if it is odd, the result of this operation is UNPREDICTABLE. 

The move is non-arithmetic; it causes no IEEE 754 exceptions. 

Restrictions: 

The fields fs and fd must specify FPRs valid for operands of type fmt\ if they are not valid, the result is UNPRE- 
DICTABLE. The operand must be a value in format fmt, if it is not, the result is UNPREDITABLE and the value of 
the operand FPR becomes UNPREDICTABLE. 

| The result of MOVF.PS is UNPREDICTABLE if the processor is executing in 16 FP registers mode. 
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MOVF.fmt 


Floating Point Move Conditional on Floating Point False (cont.) 


Operation: 


if FPConditionCode (cc) = 0 then 

StoreFPR(fd, fmt, ValueFPR(fs, fmt) ) 
else 

StoreFPR(fd, fmt, ValueFPR(fd, fmt)) 


Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Floating Point Exceptions: 

Unimplemented Operation 
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MOVN 


Move Conditional on Not Zero 


31 26 25 21 20 16 15 11 10 6 5 0 


SPECIAL 




0 

MOVN 


rs 

rt 

rd 



000000 




00000 

001011 


6 5 

5 

5 

5 

6 

Format: movn rd, rs, rt 




MIPS32 

Purpose: 






To conditionally move a GPR after testing a GPR value 

Description: if rt ^ 0 then rd <— rs 

If the value in GPR rt is not equal to zero, then the contents of GPR rs are placed into GPR rd. 

Restrictions: 

None 

Operation: 

if GPR[rt] A 0 then 
GPR [ rd] e- GPR [ rs ] 
endif 

Exceptions: 

None 

Programming Notes: 

The non-zero value tested here is the condition true result from the SLT, SLTI, SLTU, and SLTIU comparison instruc- 
tions. 
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Floating Point Move Conditional on Not Zero 


MOVN.fmt 


31 26 25 21 20 16 15 11 10 6 5 0 


COP1 





MOVN 


fmt 

rt 

fs 

fd 


010001 





010011 


6 5 5 5 5 6 


Format: movn.S fd, fs, rt 

MOVN.D fd, fs, rt 
MOVN.PS fd, fs, rt 


MIPS32 
MIPS32 
MIPS64 
MIPS32 Release 2 


Purpose: 

To test a GPR then conditionally move an FP value 

Description: if rt ^ 0 then fd <— fs 

If the value in GPR rt is not equal to zero, then the value in FPR fs is placed in FPR fd. The source and destination are 
values in format fmt. 

If GPR rt contains zero, then FPR fs is not copied and FPR fd contains its previous value in format fmt. If fd did not 
contain a value either in format fmt or previously unused data from a load or move-to operation that could be inter- 
preted in format fmt, then the value of fd becomes UNPREDICTABLE. 

The move is non-arithmetic; it causes no IEEE 754 exceptions. 

Restrictions: 

The fields fs and fd must specify FPRs valid for operands of type fmt\ if they are not valid, the result is UNPRE- 
DICTABLE. 

The operand must be a value in format fmt, if it is not, the result is UNPREDICTABLE and the value of the operand 
FPR becomes UNPREDICTABLE. 

The result of MOVN.PS is UNPREDICTABLE if the processor is executing in 16 FP registers mode. 
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Floating Point Move Conditional on Not Zero 


MOVN.fmt 


Operation: 

if GPR[rt] =£ 0 then 

StoreFPR(fd, fmt, ValueFPR(fs, fmt) ) 
else 

StoreFPR(fd, fmt, ValueFPR(fd, fmt)) 
endif 


Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Floating Point Exceptions: 

Unimplemented Operation 
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MOVT 


Move Conditional on Floating Point True 


31 26 25 21 20 18 17 16 15 11 10 6 5 0 


SPECIAL 



0 

tf 


0 

MOVCI 


rs 

cc 



rd 



000000 



0 

1 


00000 

000001 


6 5 

3 

1 1 

5 

5 

6 

Format: movt rd, rs, cc 





MIPS32 

Purpose: 







To test an FP condition code then conditionally move a GPR 

Description: if cc = 1 then rd 4— rs 

If the floating point condition code specified by CC is one, then the contents of GPR rs are placed into GPR rd. 

Restrictions: 

Operation: 

if FPConditionCode (cc) = 1 then 
GPR [ rd] 4- GPR [ rs ] 
endif 


Exceptions: 

Reserved Instruction, Coprocessor Unusable 
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Floating Point Move Conditional on Floating Point True 


MOVT.fmt 


31 

26 

25 

21 

20 18 

17 

16 

15 


11 

10 


6 

5 

0 

COP1 



fmt 

CC 

0 

tf 


fs 



fd 



MOVCF 

010001 





0 

1 








010001 

6 



5 

3 

1 

1 


5 



5 



6 

Format: 

MOVT 

,S 

fd, fs, cc 











MIPS32 


MOVT.D fd, fs, cc MIPS32 

MOVT.PS fd, fs, cc MIPS64 

| MIPS32 Release 2 

Purpose: 

To test an FP condition code then conditionally move an FP value 

Description: if cc = l then fd <— fs 

If the floating point condition code specified by CC is one, then the value in FPR fs is placed into FPR/d. The source 
and destination are values in format fmt. 

If the condition code is not one, then FPR/s is not copied and FPR /i/ contains its previous value in format fmt. If fd 
did not contain a value either in format fmt or previously unused data from a load or move-to operation that could be 
interpreted in format fmt, then the value of fd becomes undefined. 

MOVT.PS conditionally merges the lower half of FPR fs into the lower half of FPR fd if condition code CC is one, 
and independently merges the upper half of FPR fs into the upper half of FPR fd if condition code CC+1 is one. The 
CC field should be even; if it is odd, the result of this operation is UNPREDICTABLE. 

The move is non-arithmetic; it causes no IEEE 754 exceptions. 

Restrictions: 

The fields fs and fd must specify FPRs valid for operands of type frnt; if they are not valid, the result is UNPRE- 
DICTABLE. The operand must be a value in format fmt', if it is not, the result is UNPREDICTABLE and the value 
of the operand FPR becomes UNPREDICTABLE. 

| The result of MOVT.PS is UNPREDICTABLE if the processor is executing in 16 FP registers mode. 
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Floating Point Move Conditional on Floating Point True 


MOVT.fmt 


Operation: 

if FPConditionCode (cc) = 0 then 

StoreFPR(fd, fmt, ValueFPR(fs, fmt) ) 
else 

StoreFPR(fd, fmt, ValueFPR(fd, fmt)) 
endif 

Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Floating Point Exceptions: 

Unimplemented Operation 
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MOVZ 


Move Conditional on Zero 


31 26 25 21 20 16 15 11 10 6 5 0 


SPECIAL 




0 

MOVZ 


rs 

rt 

rd 



000000 




00000 

001010 


6 5 

5 

5 

5 

6 

Format: MOVZ rd, rs, rt 




MIPS32 

Purpose: 






To conditionally move a GPR after testing a GPR value 

Description: if rt = 0 then rd 4- rs 

If the value in GPR rt is equal to zero, then the contents of GPR rs are placed into GPR rd. 

Restrictions: 

None 

Operation: 

if GPR[rt] = 0 then 
GPR [ rd] e- GPR [ rs ] 
endif 

Exceptions: 

None 

Programming Notes: 

The zero value tested here is the condition false result from the SLT, SLTI, SLTU, and SLTIU comparison instruc- 
tions. 
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Floating Point Move Conditional on Zero 


MOVZ.fmt 


31 26 25 21 20 16 15 11 10 6 5 0 


COP1 





MOVZ 


fmt 

rt 

fs 

fd 


010001 





010010 


6 5 5 5 5 6 


Format: movz.S fd, fs, rt 

MOVZ.D fd, fs, rt 
MOVZ.PS fd, fs, rt 


MIPS32 
MIPS32 
MIPS64 
MIPS32 Release 2 


Purpose: 

To test a GPR then conditionally move an FP value 

Description: if rt = 0 then fd <— fs 

If the value in GPR rt is equal to zero then the value in FPR fs is placed in FPR fd. The source and destination are val- 
ues in format fmt. 

If GPR rt is not zero, then FPR /i is not copied and FPR fd contains its previous value in format fmt. If fd did not con- 
tain a value either in format fmt or previously unused data from a load or move-to operation that could be interpreted 
in format fint. then the value of fd becomes UNPREDICTABLE. 

The move is non-arithmetic; it causes no IEEE 754 exceptions. 

Restrictions: 

The fields fs and fd must specify FPRs valid for operands of type fint; if they are not valid, the result is UNPRE- 
DICTABLE. 

The operand must be a value in format fmt; if it is not, the result is UNPREDICTABLE and the value of the operand 
FPR becomes UNPREDICTABLE. 

The result of MOVZ.PS is UNPREDICTABLE if the processor is executing in 16 FP registers mode. 
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Floating Point Move Conditional on Zero (cont.) 


MOVZ.fmt 


Operation: 

if GPR[rt] = 0 then 

StoreFPR(fd, fmt, ValueFPR(fs, fmt) ) 
else 

StoreFPR(fd, fmt, ValueFPR(fd, fmt)) 
endif 

Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Floating Point Exceptions: 

Unimplemented Operation 
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MSUB 


Multiply and Subtract Word to Hi,Lo 


31 26 25 21 20 16 15 11 10 6 5 0 


SPECIAL2 



0 

0 

MSUB 


rs 

rt 




011100 



00000 

00000 

000100 


6 5 

5 

5 

5 

6 

Format: MSUB rs, rt 




MIPS32 

Purpose: 






To multiply two words and subtract the result from Hi, Lo 

Description: (hi,lo) <- (hi,lo) - (rs x rt) 

The 32-bit word value in GPR rs is multiplied by the 32-bit value in GPR rt , treating both operands as signed values, 
to produce a 64-bit result. The product is subtracted from the 64-bit concatenated values of HI and LO.. The most sig- 
nificant 32 bits of the result are written into HI and the least signficant 32 bits are written into LO. No arithmetic 
exception occurs under any circumstances. 

Restrictions: 

None 

This instruction does not provide the capability of writing directly to a target GPR. 

Operation: 

t emp <— (HI 
HI <— temp 63i 
LO <— temp 31 . 

Exceptions: 

None 

Programming Notes: 

Where the size of the operands are known, software should place the shorter operand in GPR rt. This may reduce the 
latency of the instruction on those processors which implement data-dependent instruction latencies. 


| | LO) — (GPR [ rs ] X GPR [ rt ] ) 
.32 

.0 
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Floating Point Multiply Subtract 


MSUB.fmt 


31 26 25 21 20 16 15 11 10 6 5 3 2 0 


COP IX 





MSUB 



fr 

ft 

fs 

fd 


fmt 

010011 





101 



6 

5 

5 

5 

5 

3 

3 

Format: 

MSUB . S fd, fr, fs, 

ft 




MIPS64 


MSUB.D fd, fr, fs, 

ft 




MIPS64 


MSUB.PS fd, fr, fs, 

ft 




MIPS64 






MIPS32 Release 2 


Purpose: 

To perform a combined multiply-then-subtract of FP values 

Description: fd f- (fs x ft) - fr 

The value in FPR fs is multiplied by the value in FPR ft to produce an intermediate product. The value in FPR fr is 
subtracted from the product. The subtraction result is calculated to infinite precision, rounded according to the current 
rounding mode in FCSR , and placed into FPR fd. The operands and result are values in format fmt. 

MSUB.PS multiplies then subtracts the upper and lower halves of FPR fr, FPR /.v, and FPR ft independently, and ORs 
together any generated exceptional conditions. 

Cause bits are ORed into the Flag bits if no exception is taken. 

Restrictions: 

The fields fr,fs,ft, and fd must specify FPRs valid for operands of type fmt, if they are not valid, the result is UNPRE- 
DICTABLE. 

The operands must be values in format fmt, if they are not, the result is UNPREDICTABLE and the value of the 
operand FPRs becomes UNPREDICTABLE. 

The result of MSUB.PS is UNPREDICTABLE if the processor is executing in 16 FP registers mode. 


Operation: 


vfr ValueFPR ( f r , 
vfs <r- ValueFPR(fs, 
vft <r- ValueFPR(ft, 
StoreFPR(fd, fmt, 


fmt ) 
fmt ) 
fmt ) 

(vfs X fmt vft) 


-fmt vfr) ) 
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Floating Point Multiply Subtract (cont.) 


MSUB.fmt 


Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Floating Point Exceptions: 

Inexact, Unimplemented Operation, Invalid Operation, Overflow, Underflow 
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MSUBU 


Multiply and Subtract Word to Hi,Lo 


31 26 25 21 20 16 15 11 10 6 5 0 


SPECIAL2 



0 

0 

MSUBU 


rs 

rt 




011100 



00000 

00000 

000101 


6 5 

5 

5 

5 

6 

Format: msubu rs, rt 




MIPS32 

Purpose: 






To multiply two words and subtract the result from Hi, Lo 

Description: (hi,lo) <- (hi,lo) - (rs x rt) 

The 32-bit word value in GPR rs is multiplied by the 32-bit word value in GPR rt, treating both operands as 
values, to produce a 64-bit result. The product is subtracted from the 64-bit concatenated values of HI and 
most significant 32 bits of the result are written into HI and the least signficant 32 bits are written into LO. 
metic exception occurs under any circumstances. 

Restrictions: 

None 

This instruction does not provide the capability of writing directly to a target GPR. 

Operation: 

temp <- (HI | | LO) - (GPR [ rs ] X GPR[rt]) 

HI <- temp 63 _ _ 32 
LO <— temp 31 __ 0 

Exceptions: 

None 

Programming Notes: 

Where the size of the operands are known, software should place the shorter operand in GPR rt. This may reduce the 
latency of the instruction on those processors which implement data-dependent instruction latencies. 


unsigned 
LO.. The 
No arith- 
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Move to Coprocessor 0 MTCO 

31 26 25 21 20 16 15 11 10 3 2 0 


COPO 

MT 



0 




rt 

rd 


sel 

010000 

00100 



0000 000 



6 

5 

5 

5 

8 

3 

Format: MTCO rt, 

rd 




MIPS32 

MTCO rt, 

rd, sel 




MIPS32 

Purpose: 







To move the contents of a general register to a coprocessor 0 register. 

Description: cpr[ 0, rd, sel] rt 

The contents of general register rt are loaded into the coprocessor 0 register specified by the combination of rd and 
sel. Not all coprocessor 0 registers support the the sel field. In those instances, the sel field must be set to zero. 

Restrictions: 

The results are UNDEFINED if coprocessor 0 does not contain a register as specified by rd and sel. 

Operation: 

data <— GPR[rt] 

CPR [ 0 , rd, sel ] 4- data 

Exceptions: 

Coprocessor Unusable 
Reserved Instruction 
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MTC1 


Move Word to Floating Point 


31 26 25 21 20 16 15 11 10 0 


COP1 

MT 



0 



rt 

fs 


010001 

00100 



000 0000 0000 


6 5 

5 

5 

11 


Format: MTC1 rt, fs 




MIPS32 

Purpose: 






To copy a word from a GPR to an FPU (CPI) general register 

Description: fs <- rt 

The low word in GPR rt is placed into the low word of floating point (Coprocessor 1) general register fs. 

Restrictions: 

Operation: 

data <— GPR[rt] 31-- Q 

StoreFPR ( f S, UNINTERPRETED_WORD, data) 


Exceptions: 

Coprocessor Unusable 

Historical Information: 

For MIPS I, MIPS II, and MIPS III the value of FPR /i is UNPREDICTABLE for the instruction immediately follow- 
ing MTC1. 
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MTC2 


Move Word to Coprocessor 2 


31 26 25 21 20 16 15 11 10 8 7 0 


COP2 

010010 

MT 

00100 

rt 

Impl 

6 

5 

5 

16 

Format: MTC2 

rt , rd 


MIPS32 

MTC2 

rt, rd, sel 


MIPS32 


The syntax shown above is an example using MTC1 as a model. The specific syntax is implementation dependent. 

Purpose: 

To copy a word from a GPR to a COP2 general register 

Description: CP2CPR[impl] e- rt 

The low word in GPR rt is placed into the low word of coprocessor 2 general register denoted by the Impl field. The 
interpretation of the Impl field is left entirely to the Coprocessor 2 implementation and is not specified by the archi- 
tecture. 

Restrictions: 

The results are UNPREDICTABLE if Impl specifies a coprocessor 2 register that does not exist. 

Operation: 

data <— GPR [ rt ] 

CP2CPR [ Impl ] <— data 

Exceptions: 

Coprocessor Unusable 
Reserved Instruction 
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MTHC1 


Move Word to High Half of Floating Point Register 


31 26 25 21 20 16 15 11 10 0 


COP 1 

MTH 



0 



rt 

fs 


010001 

00111 



000 0000 0000 


6 5 

5 

5 

11 

Format: mthci rt, fs 



MIPS32 Release 2 

Purpose: 





To copy a word from a GPR to the high half of an FPU (CPI) general register 

Description: fs 63 32 <- rt 

The word in GPR rt is placed into the high word of floating point (Coprocessor 1) general register/?. This instruction 
is primarily intended to support 64-bit floating point units on a 32-bit CPU, but the semantics of the instruction are 
defined for all cases. 

Restrictions: 

In implementations prior to Release 2 of the architecture, this instruction resulted in a Reserved Instruction Excep- 
tion. 

The results are UNPREDICTABLE if Status FR = 0 and/? is odd. 

Operation: 

newdata <- GPR [ rt ] olddata <- ValueFPR(fs, UNINTERPRETED_DOUBLEWORD ) 31 _ _ 0 
StoreFPR ( f s, UNINTERPRETED_DOUBLEWORD, newdata || olddata) 


Exceptions: 

Coprocessor Unusable 
Reserved Instruction 

Programming Notes 

When paired with MTC1 to write a value to a 64-bit FPR, the MTC1 must be executed first, followed by the MTHC1. 
This is because of the semantic definition of MTC1, which is not aware that software will be using an MTHC1 
instruction to complete the operation, and sets the upper half of the 64-bit FPR to an UNPREDICTABLE value. 
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Move Word to High Half of Coprocessor 2 Register 


MTHC2 
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16 



Format: MTHC 2 rt, rd MIPS32 Release 2 

mthc 2 rt, rd, sel MIPS32 Release 2 

The syntax shown above is an example using MTHC1 as a model. The specific syntax is implementation dependent. 

Purpose: 

To copy a word from a GPR to the high half of a COP2 general register 

Description: CP2CPR[Impl] 63 __ 32 <- rt 

The word in GPR rt is placed into the high word of coprocessor 2 general register denoted by the Impl field. The 
interpretation of the Impl field is left entirely to the Coprocessor 2 implementation and is not specified by the archi- 
tecture. 

Restrictions: 

The results are UNPREDICTABLE if Impl specifies a coprocessor 2 register that does not exist, or if that register is 
not 64 bits wide. 

In implementations prior to Release 2 of the architecture, this instruction resulted in a Reserved Instruction Excep- 
tion. 

Operation: 

data <— GPR [ rt ] 

CP2CPR [ Impl ] 4— data || CPR [ 2 , rd, sel ] 31 .. 0 

Exceptions: 

Coprocessor Unusable 
Reserved Instruction 

Programming Notes 

When paired with MTC2 to write a value to a 64-bit CPR, the MTC2 must be executed first, followed by the 
MTHC2. This is because of the semantic definition of MTC2, which is not aware that software will be using an 
MTHC2 instruction to complete the operation, and sets the upper half of the 64-bit CPR to an UNPREDICTABLE 
value. 
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Move to HI Register 


MTHI 


31 26 25 


21 20 
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0 
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0 

000 0000 0000 0000 


MTHI 

010001 

6 

5 


15 


6 

Format: mthi rs 





MIPS32 

Purpose: 







To copy a GPR to the special purpose HI register 

Description: hi 4— rs 

The contents of GPR rs are loaded into special register HI. 

Restrictions: 

A computed result written to the HIILO pair by DIV, DIVU,MULT, or MULTU must be read by MFHI or MFLO 
before a new result can be written into either HI or LO. 

If an MTHI instruction is executed following one of these arithmetic instructions, but before an MFLO or MFHI 
instruction, the contents of LO are UNPREDICTABLE. The following example shows this illegal situation: 

MUL r2,r4 # start operation that will eventually write to HI,LO 

... # code not containing mfhi or mflo 

MTHI r 6 

... # code not containing mflo 

MFLO r3 # this mflo would get an UNPREDICTABLE value 

Operation: 

HI 4- GPR [ rs ] 

Exceptions: 

None 

Historical Information: 

In MIPS 1-ID, if either of the two preceding instructions is MFHI, the result of that MFHI is UNPREDICTABLE. 
Reads of the HI or LO special register must be separated from any subsequent instructions that write to them by two 
or more instructions. In MIPS IV and later, including MIPS32 and MIPS64, this restriction does not exist. 
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Move to LO Register 


MTLO 
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Format: mtlo rs 





MIPS32 

Purpose: 







To copy a GPR to the special purpose LO register 

Description: lo 4— rs 

The contents of GPR rs are loaded into special register LO. 

Restrictions: 

A computed result written to the HULO pair by DIV, DIVU, MULT, or MULTU must be read by MFHI or MFLO 
before a new result can be written into either HI or LO. 

If an MTLO instruction is executed following one of these arithmetic instructions, but before an MFLO or MFHI 
instruction, the contents of HI are UNPREDICTABLE. The following example shows this illegal situation: 

MUL r2,r4 # start operation that will eventually write to HI,LO 

... # code not containing mfhi or mflo 

MTLO r 6 

... # code not containing mfhi 

MFHI r3 # this mfhi would get an UNPREDICTABLE value 

Operation: 

LO 4- GPR [ rs ] 

Exceptions: 

None 

Historical Information: 

In MIPS 1-ID, if either of the two preceding instructions is MFHI, the result of that MFHI is UNPREDICTABLE. 
Reads of the HI or LO special register must be separated from any subsequent instructions that write to them by two 
or more instructions. In MIPS IV and later, including MIPS32 and MIPS64, this restriction does not exist. 
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MUL 


Multiply Word to GPR 
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6 

Format: mul rd, rs, rt 




MIPS32 

Purpose: 






To multiply two words and write the result to a GPR. 

Description: rd f- rs x rt 

The 32-bit word value in GPR rs is multiplied by the 32-bit value in GPR rt, treating both operands as signed values, 
to produce a 64-bit result. The least significant 32 bits of the product are written to GPR rd. The contents of HI and 
LO are UNPREDICTABLE after the operation. No arithmetic exception occurs under any circumstances. 

Restrictions: 

Note that this instruction does not provide the capability of writing the result to the HI and LO registers. 

Operation: 

temp <- GPR[rs] * GPR[rt] 

GPR[rd] < temp 31 __ 0 

HI <- UNPREDICTABLE 
LO <- UNPREDICTABLE 

Exceptions: 

None 

Programming Notes: 

In some processors the integer multiply operation may proceed asynchronously and allow other CPU instructions to 
execute before it is complete. An attempt to read GPR rd before the results are written interlocks until the results are 
ready. Asynchronous execution does not affect the program result, but offers an opportunity for performance 
improvement by scheduling the multiply so that other instructions can execute in parallel. 

Programs that require overflow detection must check for it explicitly. 

Where the size of the operands are known, software should place the shorter operand in GPR rt. This may reduce the 
latency of the instruction on those processors which implement data-dependent instruction latencies. 


MIPS32™ Architecture For Programmers Volume II, Revision 2.00 


207 


Copyright © 2001-2003 MIPS Technologies Inc. All rights reserved. 





Floating Point Multiply 


MUL.fmt 
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Format: mul.s fd, fs, ft 

MUL . D fd, fs, ft 
MUL.PS fd, fs, ft 


MIPS32 
MIPS32 
MIPS64 
MIPS32 Release 2 


Purpose: 

To multiply FP values 
Description: fd 4- fs x ft 

The value in FPR fs is multiplied by the value in FPR ft. The result is calculated to infinite precision, rounded accord- 
ing to the current rounding mode in FCSR , and placed into FPR fd. The operands and result are values in format fmt. 
MUL.PS multiplies the upper and lower halves of FPR/s and FPR/f independently, and ORs together any generated 
exceptional conditions. 

Restrictions: 

The fields fs,ft, and fd must specify FPRs valid for operands of type/mf; if they are not valid, the result is UNPRE- 
DICTABLE. 

The operands must be values in format fmt', if they are not, the result is UNPREDICTABLE and the value of the 
operand FPRs becomes UNPREDICTABLE. 

The result of MUL.PS is UNPREDICTABLE if the processor is executing in 16 FP registers mode. 

Operation: 

StoreFPR (fd, fmt, ValueFPR(fs, fmt) X £mt ValueFPR(ft, fmt)) 

Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Floating Point Exceptions: 

Inexact, Unimplemented Operation, Invalid Operation, Overflow, Underflow 
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Multiply Word 


MULT 



655 10 6 


Format: mult rs, rt MIPS32 

Purpose: 

To multiply 32-bit signed integers 
Description: (hi, lo) 4- rs x rt 

The 32-bit word value in GPR rt is multiplied by the 32-bit value in GPR rs, treating both operands as signed values, 
to produce a 64-bit result. The low-order 32-bit word of the result is placed into special register LO, and the 
high-order 32-bit word is splaced into special register HI. 

No arithmetic exception occurs under any circumstances. 

Restrictions: 

None 

Operation: 

prod 4— GPR[rs] 31 _o x GPR [ r t ] 31 . . 0 
LO 4 — prod 31 _ _ o 

HI 4 - prod 63 _ _ 32 

Exceptions: 

None 

Programming Notes: 

In some processors the integer multiply operation may proceed asynchronously and allow other CPU instructions to 
execute before it is complete. An attempt to read LO or ///before the results are written interlocks until the results are 
ready. Asynchronous execution does not affect the program result, but offers an opportunity for performance 
improvement by scheduling the multiply so that other instructions can execute in parallel. 

Programs that require overflow detection must check for it explicitly. 

Where the size of the operands are known, software should place the shorter operand in GPR rt. This may reduce the 
latency of the instruction on those processors which implement data-dependent instruction latencies. 
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Multiply Unsigned Word 
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Format: multu rs, rt MIPS32 

Purpose: 

To multiply 32-bit unsigned integers 
Description: (Hi, lo) 4- rs x rt 

The 32-bit word value in GPR rt is multiplied by the 32-bit value in GPR rs, treating both operands as unsigned val- 
ues, to produce a 64-bit result. The low-order 32-bit word of the result is placed into special register LO, and the 
high-order 32-bit word is placed into special register HI. 

No arithmetic exception occurs under any circumstances. 

Restrictions: 

None 

Operation: 

procR— (0 I! GPR[rs] 31- -0 ) x (0 II GPR [rt] 31- _ 0 ) 

LO <— prod 31 _ _ q 
HI <- prod 63 _ _ 32 

Exceptions: 

None 

Programming Notes: 

In some processors the integer multiply operation may proceed asynchronously and allow other CPU instructions to 
execute before it is complete. An attempt to read LO or ///before the results are written interlocks until the results are 
ready. Asynchronous execution does not affect the program result, but offers an opportunity for performance 
improvement by scheduling the multiply so that other instructions can execute in parallel. 

Programs that require overflow detection must check for it explicitly. 

Where the size of the operands are known, software should place the shorter operand in GPR rt. This may reduce the 
latency of the instruction on those processors which implement data-dependent instruction latencies. 
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Floating Point Negate 


NEG.fmt 
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Format: neg.S fd, fs 

NEG.D fd, fs 
NEG.PS fd, fs 


MIPS32 
MIPS32 
MIPS64 
MIPS32 Release 2 


Purpose: 

To negate an FP value 

Description: fd < fs 

The value in FPR fs is negated and placed into FPR fd. The value is negated by changing the sign bit value. The oper- 
and and result are values in format fmt. NEG.PS negates the upper and lower halves of FPR fs independently, and 
ORs together any generated exceptional conditions. 

This operation is arithmetic; a NaN operand signals invalid operation. 

Restrictions: 

The fields fs and fd must specify FPRs valid for operands of type fmt; if they are not valid, the result is UNPRE- 
DICTABLE. The operand must be a value in format fmt; if it is not, the result is UNPREDICTABLE and the value 
of the operand FPR becomes UNPREDICTABLE. 

The result of NEG.PS is UNPREDICTABLE if the processor is executing in 16 FP registers mode. 

Operation: 

StoreFPR(fd, fmt, Negate (ValueFPR ( fs , fmt))) 


Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Floating Point Exceptions: 

Unimplemented Operation, Invalid Operation 
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Floating Point Negative Multiply Add NMADD.fmt 
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Format: nmadd.s fd, fr, fs, ft 

NMADD . D fd, fr, fs, ft 
NMADD.PS fd, fr, fs, ft 

I 

Purpose: 

To negate a combined multiply-then-add of FP values 

Description: fd «- - ((fs x ft) + fr) 

The value in FPR fs is multiplied by the value in FPR ft to produce an intermediate product. The value in FPR/r is 
added to the product. 

The result sum is calculated to infinite precision, rounded according to the current rounding mode in FCSR , negated 
by changing the sign bit, and placed into FPR/d. The operands and result are values in format fmt. 

I NMADD.PS applies the operation to the upper and lower halves of FPR/r, FPR fs, and FPR ft independently, and 
ORs together any generated exceptional conditions. 

Cause bits are ORed into the Flag bits if no exception is taken. 

Restrictions: 

The fields fr,fs,ft, and fd must specify FPRs valid for operands of type fmt, if they are not valid, the result is UNPRE- 
DICTABLE. 

The operands must be values in format fmt; if they are not, the result is UNPREDICTABLE and the value of the 
operand FPRs becomes UNPREDICTABLE. 

| The result of NMADD.PS is UNPREDICTABLE if the processor is executing in 16 FP registers mode. 


MIPS64 
MIPS64 
MIPS64 
MIPS32 Release 2 


Operation: 

vfr <— ValueFPR(fr, fmt) 
vfs ValueFPR(fs, fmt) 
vft <r- ValueFPR(ft, fmt) 

StoreFPR(fd, fmt, —(vfr + fmt (vfs X fmt vft))) 


212 


MIPS32™ Architecture For Programmers Volume II, Revision 2.00 


Copyright © 2001-2003 MIPS Technologies Inc. All rights reserved. 




Floating Point Negative Multiply Add (cont.) 


NMADD.fmt 


Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Floating Point Exceptions: 

Inexact, Unimplemented Operation, Invalid Operation, Overflow, Underflow 
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Floating Point Negative Multiply Subtract 


NMSUB.fmt 
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Format: nmsub. s fd. 

fr, 

fs. 

ft 

MIPS64 

NMSUB. D fd, 

fr, 

fs. 

ft 

MIPS64 

NMSUB.PS fd, 

fr, 

fs, 

r ft 

MIPS64 


MIPS32 Release 2 


Purpose: 

To negate a combined multiply-then-subtract of FP values 

Description: fd <- - ( (fs x ft) - fr) 

The value in FPR fs is multiplied by the value in FPR ft to produce an intermediate product. The value in FPR fr is 
subtracted from the product. 

The result is calculated to infinite precision, rounded according to the current rounding mode in FCSR , negated by 
changing the sign bit, and placed into FPR /d The operands and result are values in format fint. 

NMSUB.PS applies the operation to the upper and lower halves of FPR fr, FPR/s, and FPR ft independently, and 
ORs together any generated exceptional conditions. 

Cause bits are ORed into the Flag bits if no exception is taken. 

Restrictions: 

The fields fr,fs,ft, and fd must specify FPRs valid for operands of type/mf; if they are not valid, the result is UNPRE- 
DICTABLE. 

The operands must be values in format fmt, if they are not, the result is UNPREDICTABLE and the value of the 
operand FPRs becomes UNPREDICTABLE. 

The result ofNMSUB.PS is UNPREDICTABLE if the processor is executing in 16 FP registers mode. 


Operation: 

vfr ValueFPR ( f r , 
vfs <— ValueFPR(fs, 
vft <r- ValueFPR(ft, 
StoreFPR(fd, fmt, — 


fmt ) 
fmt ) 
fmt ) 

((vfs X fmt vft) - fmt vfr)) 
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Floating Point Negative Multiply Subtract (cont.) 


NMSUB.fmt 


Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Floating Point Exceptions: 

Inexact, Unimplemented Operation, Invalid Operation, Overflow, Underflow 
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No Operation NOP 
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Format: nop 
Purpose: 

To perform no operation. 

Description: 




Assembly Idiom 


NOP is the assembly idiom used to denote no operation. The actual instruction is interpreted by the hardware as SLL 
rO, rO, 0. 

Restrictions: 

None 

Operation: 

None 

Exceptions: 

None 

Programming Notes: 

The zero instruction word, which represents SLL, rO, rO, 0, is the preferred NOP for software to use to fill branch and 
jump delay slots and to pad out alignment sequences. 
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NOR 


Not Or 
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Format: NOR rd, rs, rt 

Purpose: 

To do a bitwise logical NOT OR 
Description: rd 4— rs nor rt 




MIPS32 


The contents of GPR rs are combined with the contents of GPR rt in a bitwise logical NOR operation. The result is 
placed into GPR rd. 

Restrictions: 

None 

Operation: 

GPR [ rd] <- GPR [ rs ] nor GPR[rt] 

Exceptions: 

None 
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Or 


OR 
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Format: OR rd, rs, rt 

Purpose: 

To do a bitwise logical OR 

Description: rd 4— rs or rt 




MIPS32 


The contents of GPR rs are combined with the contents of GPR rt in a bitwise logical OR operation. The result is 
placed into GPR rd. 

Restrictions: 

None 

Operation: 

GPR [ rd] <- GPR [ rs ] or GPR[rt] 

Exceptions: 

None 
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Or Immediate 
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Format: ORI rt, rs, immediate MIPS32 

Purpose: 

To do a bitwise logical OR with a constant 

Description: rt 4— rs or immediate 

The 16-bit immediate is zero-extended to the left and combined with the contents of GPR rs in a bitwise logical OR 
operation. The result is placed into GPR rt. 

Restrictions: 

None 

Operation: 

GPR[rt] 4— GPRfrs] or zero_extend (immediate) 

Exceptions: 

None 
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Pair Lower Lower 
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Format: pll.ps fd, fs, ft MIPS64 

MIPS32 Release 2 

Purpose: 

To merge a pair of paired single values with realignment 

Description: fd lower (fs) | | lower (ft) 

A new paired-single value is formed by catenating the lower single of fs (bits 31. .0) and the lower single of ft (bits 
31. .0). 

The move is non-arithmetic; it causes no IEEE 754 exceptions. 

Restrictions: 

The fields fs, ft, and fd must specify FPRs valid for operands of type PS. If they are not valid, the result is UNPRE- 
DICTABLE. 

The result of this instruction is UNPREDICTABLE if the processor is executing in 16 FP registers mode. 

Operation: 

StoreFPR ( fd, PS, ValueFPR(fs, PS ) 31 _ 0 II ValueFPR(ft, PS) 31 .. 0 ) 


Exceptions: 

Coprocessor Unusable, Reserved Instruction 
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Pair Lower Upper 


PLU.PS 
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Format: plu.ps fd, fs, ft MIPS64 


MIPS32 Release 2 

Purpose: 

To merge a pair of paired single values with realignment 

Description: fd lower (fs) | | upper (ft) 

A new paired-single value is formed by catenating the lower single of fs (bits 31..0) and the upper single of ft (bits 
63.32). 

The move is non-arithmetic; it causes no IEEE 754 exceptions. 

Restrictions: 

The fields fs, ft , and fd must specify FPRs valid for operands of type PS. If they are not valid, the result is UNPRE- 
DICTABLE. 

The result of this instruction is UNPREDICTABLE if the processor is executing in 16 FP registers mode. 

Operation: 

StoreFPR ( fd, PS, ValueFPR(fs, PS ) 31--0 |[ ValueFPR(ft, PS) 63 .. 32 ) 

Exceptions: 

Coprocessor Unusable, Reserved Instruction 
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Prefetch 


PREF 


31 26 25 21 20 16 15 


0 


PREF 

110011 


base 


hint 


offset 


6 


5 


5 


16 


Format: PREF hint, off set (base) MIPS32 

Purpose: 

To move data between memory and cache. 

Description: prefetch_memory (base+of fset) 

PREF adds the 16-bit signed offset to the contents of GPR base to form an effective byte address. The hint field sup- 
plies information about the way that the data is expected to be used. 

PREF enables the processor to take some action, typically prefetching the data into cache, to improve program perfor- 
mance. The action taken for a specific PREF instruction is both system and context dependent. Any action, including 
doing nothing, is permitted as long as it does not change architecturally visible state or alter the meaning of a pro- 
gram. Implementations are expected either to do nothing, or to take an action that increases the performance of the 
program. The PrepareForStore function is unique in that it may modify the architecturally visible state. 

PREF does not cause addressing-related exceptions. If the address specified would cause an addressing exception, the 
exception condition is ignored and no data movement occurs. However even if no data is prefetched, some action that 
is not architecturally visible, such as writeback of a dirty cache line, can take place. 

It is implementation dependent whether a Bus Error or Cache Error exception is reported if such an error is detected 
as a byproduct of the action taken by the PREF instruction. 

PREF never generates a memory operation for a location with an uncached memory access type. 

If PREF results in a memory operation, the memory access type used for the operation is determined by the memory 
access type of the effective address, just as it would be if the memory operation had been caused by a load or store to 
the effective address. 

For a cached location, the expected and useful action for the processor is to prefetch a block of data that includes the 
effective address. The size of the block and the level of the memory hierarchy it is fetched into are implementation 
specific. 

The hint field supplies information about the way the data is expected to be used. With the exception of PrepareFor- 
Store, a hint value cannot cause an action to modify architecturally visible state. A processor may use a hint value to 
improve the effectiveness of the prefetch action. 
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Prefetch (cont.) 


PREF 


Table 3-30 Values of the hint Field for the PREF Instruction 


Value 

Name 

Data Use and Desired Prefetch Action 

0 

load 

Use: Prefetched data is expected to be read (not modified). 
Action: Fetch data as if for a load. 

1 

store 

Use: Prefetched data is expected to be stored or modified. 
Action: Fetch data as if for a store. 

2-3 

Reserved 

Reserved for future use - not available to implementations. 

4 

load_streamed 

Use: Prefetched data is expected to be read (not modified) but not 
reused extensively; it “streams” through cache. 

Action: Fetch data as if for a load and place it in the cache so that it 
does not displace data prefetched as “retained.” 

5 

store_streamed 

Use: Prefetched data is expected to be stored or modified but not 
reused extensively; it “streams” through cache. 

Action: Fetch data as if for a store and place it in the cache so that 
it does not displace data prefetched as “retained.” 

6 

load_retained 

Use: Prefetched data is expected to be read (not modified) and 
reused extensively; it should be “retained” in the cache. 

Action: Fetch data as if for a load and place it in the cache so that it 
is not displaced by data prefetched as “streamed.” 

7 

store_retained 

Use: Prefetched data is expected to be stored or modified and reused 
extensively; it should be “retained” in the cache. 

Action: Fetch data as if for a store and place it in the cache so that 
it is not displaced by data prefetched as “streamed.” 
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Table 3-30 Values of the hint Field for the PREF Instruction 


8-24 

Reserved 

Reserved for future use - not available to implementations. 

25 

writeback_invalidate 
(also known as “nudge”) 

Use: Data is no longer expected to be used. 

Action: For a writeback cache, schedule a wirteback of any dirty 
data. At the completion of the writeback, mark the state of any cache 
lines written back as invalid. 

26-29 

Implementation 

Dependent 

Unassigned by the Architecture - available for 
implementation-dependent use. 



Use: Prepare the cache for writing an entire line, without the 
overhead involved in filling the line from memory. 

30 

PrepareForStore 

Action: If the reference hits in the cache, no action is taken. If the 
reference misses in the cache, a line is selected for replacement, any 
valid and dirty victim is written back to memory, the entire line is 
filled with zero data, and the state of the line is marked as valid and 
dirty. 



Programming Note: Because the cache line is filled with zero data 
on a cache miss, software must not assume that this action, in and 
of itself, can be used as a fast bzero-type function. 

31 

Implementation 

Dependent 

Unassigned by the Architecture - available for 
implementation-dependent use. 
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Prefetch (cont.) 


PREF 


Restrictions: 

None 

Operation: 

vAddr <r- GPR[base] + sign_extend (off set) 

(pAddr, CCA) <— AddressTranslation (vAddr, DATA, LOAD) 

Prefetch (CCA, pAddr, vAddr, DATA, hint) 

Exceptions: 

Bus Error, Cache Error 

Prefetch does not take any TLB-related or address-related exceptions under any circumstances. 

Programming Notes: 

Prefetch cannot prefetch data from a mapped location unless the translation for that location is present in the TLB. 
Locations in memory pages that have not been accessed recently may not have translations in the TLB, so prefetch 
may not be effective for such locations. 

Prefetch does not cause addressing exceptions. It does not cause an exception to prefetch using an address pointer 
value before the validity of a pointer is determined. 

It is implementation dependent whether a Bus Error or Cache Error exception is reported if such an error is detected 
as a byproduct of the action taken by the PREL instruction. Typically, this only occurs in systems which have 
high-reliability requirements. 

Prefetch operations have no effect on cache lines that were previously locked with the CACHE instruction. 

Hint field encodings whose function is described as “streamed” or “retained” convey usage intent from software to 
hardware. Software should not assume that hardware will always prefetch data in an optimal way. If data is to be truly 
retained, software should use the Cache instruction to lock data into the cache. 
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Prefetch Indexed 


PREFX 


31 26 25 21 20 16 15 11 10 6 5 0 


COP IX 




0 

PREFX 


base 

index 

hint 



010011 




00000 

001111 


6 5 5 5 5 6 


Format: PREFX hint, index (base) MIPS64 

MIPS32 Release 2 

Purpose: 

To move data between memory and cache. 

Description: prefetch_memory [base + index] 

PREFX adds the contents of GPR index to the contents of GPR base to form an effective byte address. The hint field 
supplies information about the way the data is expected to be used. 

The only functional difference between the PREF and PREFX instructions is the addressing mode implemented by 
the two. Refer to the PREF instruction for all other details, including the encoding of the hint field. 

Restrictions: 


Operation: 

vAddr 4— GPR [base] + GPR [index] 

(pAddr, CCA) 4— AddressTranslation (vAddr, DATA, LOAD) 

Prefetch (CCA, pAddr, vAddr, DATA, hint) 

Exceptions: 

Coprocessor Unusable, Reserved Instruction, Bus Error, Cache Error 

Programming Notes: 

The PREFX instruction is only available on processors that implement floating point and should never by generated 
by compilers in situations in which the corresponding load and store indexed floating point instructions are generated. 

Also refer to the corresponding section in the PREF instruction description. 
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Pair Upper Lower 


PUL. PS 


31 26 25 21 20 16 15 11 10 6 5 0 


COP1 

fmt 




PUL 



ft 

fs 

fd 


010001 

10110 




101110 


6 5 5 5 5 6 

Format: pul. PS fd, fs, ft MIPS64 


MIPS32 Release 2 

Purpose: 

To merge a pair of paired single values with realignment 

Description: fd upper (fs) | | lower (ft) 

A new paired-single value is formed by catenating the upper single of fs (bits 63.32) and the lower single of ft (bits 
31. .0). 

The move is non-arithmetic; it causes no IEEE 754 exceptions. 

Restrictions: 

The fields fs, ft, and fd must specify FPRs valid for operands of type PS. If they are not valid, the result is UNPRE- 
DICTABLE. 

The result of this instruction is UNPREDICTABLE if the processor is executing in 16 FP registers mode. 

Operation: 

StoreFPR ( fd, PS, ValueFPR(fs, PS ) 63--32 II ValueFPR(ft, PS) 31--0 ) 

Exceptions: 

Coprocessor Unusable, Reserved Instruction 
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Pair Upper Upper 


PUU.PS 


228 


31 26 25 21 20 16 15 11 10 6 5 0 


COP1 

fmt 




PUU 



ft 

fs 

fd 


010001 

10110 




101111 


6 5 

5 

5 

5 

6 

Format: PUU.PS fd, fs, ft 




MIPS64 





MIPS32 Release 2 

Purpose: 






To merge a pair of paired single values with realignment 

Description: fd <- upper (fs) | | upper (ft) 

A new paired-single value is formed by catenating the upper single of fs (bits 63.32) and the upper single of ft (bits 
63.32). 

The move is non-arithmetic; it causes no IEEE 754 exceptions. 

Restrictions: 

The fields fs, ft, and fd must specify FPRs valid for operands of type PS. If they are not valid, the result is UNPRE- 
DICTABLE. 

The result of this instruction is UNPREDICTABLE if the processor is executing in 16 FP registers mode. 

Operation: 

StoreFPR ( fd, PS, ValueFPR(fs, PS ) 63 _ 32 II ValueFPR(ft, PS) 63 .. 32 ) 

Exceptions: 

Coprocessor Unusable, Reserved Instruction 
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RDHWR 


Read Hardware Register 


31 26 25 21 20 16 15 11 10 6 5 0 


SPECIAL3 

0 



0 

RDHWR 



rt 

rd 



0111 11 

00 000 



000 00 

11 1011 


6 5 5 5 2 3 6 


Format: rdhwr rt,rd 


MIPS32 Release 2 


Purpose: 

To move the contents of a hardware register to a general purpose register (GPR) if that operation is enabled by privi- 
leged software. 


Description: rt «- HWR[rd] 

If access is allowed to the specified hardware register, the contents of the register specified by rd is loaded into gen- 
eral register rt. Access control for each register is selected by the bits in the coprocessor 0 HWREna register. 

The available hardware registers, and the encoding of the rd field for each, are shown in Table 3-31. 


Table 3-31 Hardware Register List 


Register Number 
(rd Value) 


Register 

Name 


Contents 


0 


1 


2 


CPUNum 


Number of the CPU on which the program is currently running. 
This comes directly from the coprocessor 0 EBase CPUNum field. 


SYNCI_Step 


Address step size to be used with the SYNCI instruction. See that 
instruction’s description for the use of this value. 


High-resolution cycle counter. This comes directly from the 
coprocessor 0 Count register. 


Resolution of the CC register. This value denotes the number of 
cycles between update of the register. For example: 


3 


All others 


CCRes Value 

Meaning 

1 

CC register increments every CPU cycle 

2 

CC register increments every second CPU cycle 

3 

CC register increments every third CPU cycle 

etc. 


Access results in a Reserved Instruction Exception 
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RDHWR 


Read Hardware Register, cont. 


Restrictions: 

In implementations of Release 1 of the Architecture, this instruction resulted in a Reserved Instruction Exception. 

Operation: 

case rd 

16#00: temp 4— EBase CPUNum 
16#01: temp 4— SYNCI_StepSize ( ) 

16#02: temp 4— Count 

16#03: temp 4— CountResolution ( ) 
otherwise : SignalException (Reservedlnstruction) 
endcase 

GPR[rt] 4— temp 


Exceptions: 

Reserved Instruction 
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RDPGPR 


Read GPR from Previous Shadow Set 


31 26 25 21 20 16 15 11 10 0 


COPO 
0100 00 


RDPGPR 
01 010 

rt 

rd 

0 

000 0000 0000 

6 

Format: 

Purpose: 

5 

RDPGPR rd, rt 

5 

5 

11 

MIPS32 Release 2 


To move the contents of a GPR from the previous shadow set to a current GPR. 

Description: rd 4- SGPR[SRSCti PSS , rt] 

The contents of the shadow GPR register specified by SRSCtlpgg (signifying the previous shadow set number) and rt 
(specifying the register number within that set) is moved to the current GPR rd. 

Restrictions: 

In implementations prior to Release 2 of the Architecture, this instruction resulted in a Reserved Instruction Excep- 
tion. 

Operation: 

GPR [ rd] <- SGPR[SRSCtl PSS , rt ] 

Exceptions: 

Coprocessor Unusable 
Reserved Instruction 


MIPS32™ Architecture For Programmers Volume II, Revision 2.00 


231 


Copyright © 2001-2003 MIPS Technologies Inc. All rights reserved. 




RECIP.fmt 


Reciprocal Approximation 


31 26 25 21 20 16 15 11 10 6 5 0 


COP1 


0 



RECIP 


fmt 


fs 

fd 


010001 


00000 



010101 


6 

5 

5 

5 

5 

6 

Format: recip. s 

RECIP .D 

fd, f s 
fd, f s 




MIPS64 
MIPS32 Release 2 
MIPS64 
MIPS32 Release 2 


Purpose: 

To approximate the reciprocal of an FP value (quickly) 

Description: fd f- 1.0 / fs 

The reciprocal of the value in FPR fs is approximated and placed into FPR fd. The operand and result are values in 
format fmt. 

The numeric accuracy of this operation is implementation dependent; it does not meet the accuracy specified by the 
IEEE 754 Floating Point standard. The computed result differs from the both the exact result and the IEEE-mandated 
representation of the exact result by no more than one unit in the least-significant place (ULP). 

It is implementation dependent whether the result is affected by the current rounding mode in FCSR. 

Restrictions: 

The fields fs and fd must specify FPRs valid for operands of type fmt\ if they are not valid, the result is UNPRE- 
DICTABLE. 

The operand must be a value in format fmt; if it is not, the result is UNPREDICTABLE and the value of the operand 
FPR becomes UNPREDICTABLE. 

The result of RECIPD is UNPREDICTABLE if the processor is executing in 16 FP registers mode. 

Operation: 

StoreFPR(fd, fmt, 1.0 / valueFPR(fs, fmt)) 
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Reciprocal Approximation (cont.) 

Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Floating Point Exceptions: 

Inexact, Division-by-zero, Unimplemented Op, Invalid Op, Overflow, Underflow 
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ROTR 


Rotate Word Right 


31 26 25 22 21 20 16 15 11 10 6 5 0 


SPECIAL 

0000 

R 

1 

rt 

rd 

sa 

SRL 

000000 





000010 


6 4 1 

5 

5 

5 

6 

Format: rotr rd, rt, sa 




SmartMIPS Crypto 





MIPS32 Release 2 

Purpose: 






To execute a logical right-rotate of a word by a fixed number of bits 
Description: rd 4— rt <-> (right) sa 

The contents of the low-order 32-bit word of GPR rt are rotated right; the word result is placed in GPR rd. The 
bit-rotate amount is specified by sa. 

Restrictions: 

Operation: 

if ( (ArchitectureRevision ( ) < 2) and (Config3 SM = 0)) then 

UNPREDICTABLE 

endif 

s 4- sa 

temp 4— GPR [rt ] s _ x _ _ 0 II GPR[rt] 3 1--s 
GPR[rd]4— temp 

Exceptions: 

Reserved Instruction 
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Rotate Word Right Variable 


ROTRV 


31 26 25 21 20 16 15 11 10 7 6 5 0 


SPECIAL 

rs 

rt 

rd 

0000 

R 

1 

SRLV 

000000 





000110 


6 5 

5 

5 

4 

1 6 

Format: rotrv rd, rt, rs 




SmartMIPS Crypto 





MIPS32 Release 2 

Purpose: 






To execute a logical right-rotate of a word by a variable number of bits 
Description: rd <— rt <-> (right) rs 

The contents of the low-order 32-bit word of GPR rt are rotated right; the word result is placed in GPR rd. The 
bit-rotate amount is specified by the low-order 5 bits of GPR rs. 

Restrictions: 

Operation: 

if ( (ArchitectureRevision ( ) < 2) and (Config3 SM = 0)) then 

UNPREDICTABLE 

endif 

s <— GPR [ rs ] 4 _o 

temp <— GPR [rt ] s _ x _ _ o II GPR[rt] 3 1--s 

GPR[rd]<— temp 

Exceptions: 

Reserved Instruction 
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Floating Point Round to Long Fixed Point 


ROUND.L.fmt 


31 26 25 21 20 16 15 11 10 6 5 0 


COP1 


0 



ROUND. L 


fmt 


fs 

fd 


010001 


00000 



001000 


6 5 5 5 5 6 


Format: round . l . s 

fd, f s 

MIPS64 
MIPS32 Release 2 

ROUND . L . D 

fd, f s 

MIPS64 
MIPS32 Release 2 


Purpose: 

To convert an FP value to 64-bit fixed point, rounding to nearest 

Description: fd convert_and_round (fs) 

The value in FPR fs, in format fmt, is converted to a value in 64-bit long fixed point format and rounded to near- 
est/even (rounding mode 0). The result is placed in FPR fd. 

When the source value is Infinity, NaN, or rounds to an integer outside the range -2 63 to 2 63 -l, the result cannot be 
represented correctly and an IEEE Invalid Operation condition exists. In this case the Invalid Operation flag is set in 
the FCSR. If the Invalid Operation Enable bit is set in the FCSR , no result is written to fd and an Invalid Operation 
exception is taken immediately. Otherwise, the default result, 2 63 -l, is written to fd. 

Restrictions: 

The fields/? and fd must specify valid FPRs;/,v for type fmt and fd for long fixed point; if they are not valid, the result 

is UNPREDICTABLE. 

The operand must be a value in format fmt, if it is not, the result is UNPREDICTABLE and the value of the operand 
FPR becomes UNPREDICTABLE. 

The result of this instruction is UNPREDICTABLE if the processor is executing in 16 FP registers mode. 

Operation: 

StoreFPR(fd, L, ConvertFmt (ValueFPR ( f s , fmt), fmt, L) ) 
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Floating Point Round to Long Fixed Point (cont.) 


ROUND.L.fmt 


Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Floating Point Exceptions: 

Inexact, Unimplemented Operation, Invalid Operation, Overflow 
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Floating Point Round to Word Fixed Point 


ROUND. W.fmt 


31 26 25 21 20 16 15 11 10 6 5 0 


COP1 


0 



ROUND .W 


fmt 


fs 

fd 


010001 


00000 



001100 


6 5 5 5 5 6 


Format: round . w . s 

fd. 

f s 

MIPS32 

ROUND . W . D 

fd, 

f s 

MIPS32 


Purpose: 

To convert an FP value to 32-bit fixed point, rounding to nearest 

Description: fd convert_and_round (fs) 

The value in FPR/s, in format fmt, is converted to a value in 32-bit word fixed point format rounding to nearest/even 
(rounding mode 0). The result is placed in FPR fd. 

o i o 1 

When the source value is Infinity, NaN, or rounds to an integer outside the range -2 to 2 -1, the result cannot be 
represented correctly and an IEEE Invalid Operation condition exists. In this case the Invalid Operation flag is set in 
the FCSR. If the Invalid Operation Enable bit is set in the FCSR , no result is written to fd and an Invalid Operation 

• • • Q 1 

exception is taken immediately. Otherwise, the default result, 2 —1, is written to fd. 

Restrictions: 

The fields/s and /d must specify valid FPRs; /,v for typ efmt and fd for word fixed point; if they are not valid, the result 

is UNPREDICTABLE. 

The operand must be a value in format fmt', if it is not, the result is UNPREDICTABLE and the value of the operand 
FPR becomes UNPREDICTABLE. 

Operation: 

StoreFPR(fd, W, ConvertFmt (ValueFPR ( f s , fmt), fmt, W) ) 
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Floating Point Round to Word Fixed Point (cont). 


ROUND. W.fmt 


Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Floating Point Exceptions: 

Inexact, Unimplemented Operation, Invalid Operation, Overflow 
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RSQRT.fmt 


Reciprocal Square Root Approximation 


31 26 25 21 20 16 15 11 10 6 5 0 


COP1 

fmt 

0 

fs 

fd 

RSQRT 

010001 


00000 



010110 


6 


5 

5 

5 

5 

6 

Format: 

RSQRT . S 
RSQRT.D 

fd, f s 
fd, f s 




MIPS64 
MIPS32 Release 2 
MIPS64 
MIPS32 Release 2 


Purpose: 

To approximate the reciprocal of the square root of an FP value (quickly) 

Description: fd 4- l.o / sqrt(fs) 

The reciprocal of the positive square root of the value in FPR fs is approximated and placed into FPR/r/. The operand 
and result are values in format 

The numeric accuracy of this operation is implementation dependent; it does not meet the accuracy specified by the 
IEEE 754 Floating Point standard. The computed result differs from both the exact result and the IEEE-mandated 
representation of the exact result by no more than two units in the least-significant place (ULP). 

The effect of the current FCSR rounding mode on the result is implementation dependent. 

Restrictions: 

The fields fs and fd must specify FPRs valid for operands of type fmt, if they are not valid, the result is UNPRE- 
DICTABLE. 

The operand must be a value in format fmt, if it is not, the result is UNPREDICTABLE and the value of the operand 
FPR becomes UNPREDICTABLE. 

The result of RSQRT.D is UNPREDICTABLE if the processor is executing in 16 FP registers mode. 

Operation: 

StoreFPR(fd, fmt, 1.0 / SquareRoot ( valueFPR ( f s , fmt))) 
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RSQRT.fmt 


Reciprocal Square Root Approximation (cont.) 

Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Floating Point Exceptions: 

Inexact, Division-by-zero, Unimplemented Operation, Invalid Operation, Overflow, Underflow 
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26 25 


21 20 


16 15 


Store Byte 


SB 


31 


0 


SB 

101000 


base 


rt 


offset 


6 


5 


5 


16 


Format: SB rt, offset (base) MIPS32 

Purpose: 

To store a byte to memory 

Description: memory [basetoff set ] 4— rt 

The least-significant 8-bit byte of GPR rt is stored in memory at the location specified by the effective address. The 
16-bit signed offset is added to the contents of GPR base to form the effective address. 

Restrictions: 

None 

Operation: 

vAddr 4— sign_extend (off set) + GPRfbase] 

(pAddr, CCA ) 4— AddressTranslation (vAddr, DATA, STORE) 

2 

pAddr 4— pAddr PSIZE _ 1 . . 2 I 1 (pAddr^ 0 xor ReverseEndian ) 

*2 

bytesel 4— vAddr x 0 xor BigEndianCPU 
dataword 4- GPR [ rt ] 31 - 8 *by tesel . . 0 II 0 8 * bytesel 
StoreMemory (CCA, BYTE, dataword, pAddr, vAddr, DATA) 

Exceptions: 

TLB Refill, TLB Invalid, TLB Modified, Bus Error, Address Error, Watch 
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Store Conditional Word 


SC 


31 26 25 21 20 16 15 


0 


SC 

111000 


base 


rt 


offset 


6 


5 


5 


16 


Format: SC rt, offset (base) MIPS32 

Purpose: 

To store a word to memory to complete an atomic read-modify-write 

Description: if atomic_update then memory [basetoff set ] 4— rt, rt 4— 1 else rt 4— 0 

The LL and SC instructions provide primitives to implement atomic read-modify-write (RMW) operations for syn- 
chronizable memory locations. 

The 32-bit word in GPR rt is conditionally stored in memory at the location specified by the aligned effective address. 
The 16-bit signed offset is added to the contents of GPR base to form an effective address. 

The SC completes the RMW sequence begun by the preceding LL instruction executed on the processor. To complete 
the RMW sequence atomically, the following occur: 

• The 32-bit word of GPR rt is stored into memory at the location specified by the aligned effective address. 

• A 1, indicating success, is written into GPR rt. 

Otherwise, memory is not modified and a 0, indicating failure, is written into GPR rt. 

If either of the following events occurs between the execution of LL and SC, the SC fails: 

• A coherent store is completed by another processor or coherent I/O module into the block of synchronizable 
physical memory containing the word. The size and alignment of the block is implementation dependent, but it 
is at least one word and at most the minimum page size. 

• An ERET instruction is executed. 

If either of the following events occurs between the execution of LL and SC, the SC may succeed or it may fail; the 
success or failure is not predictable. Portable programs should not cause one of these events. 

• A memory access instruction (load, store, or prefetch) is executed on the processor executing the LL/SC. 

• The instructions executed starting with the LL and ending with the SC do not lie in a 2048-byte contiguous 
region of virtual memory. (The region does not have to be aligned, other than the alignment required for 
instruction words.) 

The following conditions must be true or the result of the SC is UNPREDICTABLE: 

• Execution of SC must have been preceded by execution of an LL instruction. 

• An RMW sequence executed without intervening events that would cause the SC to fail must use the same 
address in the LL and SC. The address is the same if the virtual address, physical address, and cache-coherence 
algorithm are identical. 
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Store Conditional Word (cont.) 


SC 


Atomic RMW is provided only for synchronizable memory locations. A synchronizable memory location is one that 
is associated with the state and logic necessary to implement the LL/SC semantics. Whether a memory location is 
synchronizable depends on the processor and system configurations, and on the memory access type used for the 
location: 

• Uniprocessor atomicity: To provide atomic RMW on a single processor, all accesses to the location must be 
made with memory access type of either cached noncoherent or cached coherent. All accesses must be to one or 
the other access type, and they may not be mixed. 

• MP atomicity: To provide atomic RMW among multiple processors, all accesses to the location must be made 
with a memory access type of cached coherent. 

• I/O System: To provide atomic RMW with a coherent I/O system, all accesses to the location must be made 
with a memory access type of cached coherent. If the I/O system does not use coherent memory operations, then 
atomic RMW cannot be provided with respect to the I/O reads and writes. 

Restrictions: 

The addressed location must have a memory access type of cached noncoherent or cached coherent; if it does not, the 
result is UNPREDICTABLE. 

The effective address must be naturally-aligned. If either of the 2 least-significant bits of the address is non-zero, an 
Address Error exception occurs. 

Operation: 

vAddr <— sign_extend (off set) + GPRfbase] 
if vAddr-L q ^ 0 2 then 

SignalException (AddressError ) 
endif 

(pAddr, CCA) AddressTranslation (vAddr, DATA, STORE) 
dataworde- GPRfrt] 
if LLbit then 

StoreMemory (CCA, WORD, dataword, pAddr, vAddr, DATA) 
endif 

GPR [ rt ] <— 0 31 [ | LLbit 
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Store Conditional Word (cont.) 


SC 


Exceptions: 

TLB Refill, TLB Invalid, TLB Modified, Address Error, Watch 

Programming Notes: 

LL and SC are used to atomically update memory locations, as shown below. 


LI : 


LL 

Tl, 

(TO) 

# 

load counter 



ADD I 

T2 , 

Tl, 1 

# 

increment 



SC 

T2 , 

(TO) 

# 

try to store, 

checking 

for atomicity 

BEQ 

T2 , 

0, LI 

# 

if not atomic 

(0), try 

again 

NOP 



# 

b ranch -del ay 

slot 



Exceptions between the LL and SC cause SC to fail, so persistent exceptions must be avoided. Some examples of 
these are arithmetic operations that trap, system calls, and floating point operations that trap or require software emu- 
lation assistance. 

LL and SC function on a single processor for cached noncoherent memory so that parallel programs can be run on 
uniprocessor systems that do not support cached coherent memory access types. 
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Software Debug Breakpoint 


SDBBP 


31 26 25 6 5 0 


SPECIAL2 


SDBBP 


code 


Of 1100 


linn 


6 20 6 


Format: SDBBP code EJTAG 

Purpose: 

To cause a debug breakpoint exception 

Description: 

This instruction causes a debug exception, passing control to the debug exception handler. If the processor is execut- 
ing in Debug Mode when the SDBBP instruction is executedthe exception is a Debug Mode Exception, which sets 
the Debug DExc c 0( j e field to the value 0x9 (Bp). The code field can be used for passing information to the debug 
exception handler, and is retrieved by the debug exception handler only by loading the contents of the memory word 
containing the instruction, using the DEPC register. The CODE held is not used in any way by the hardware. 

Restrictions: 

Operation: 

If Debug DM = 0 then 

SignalDebugBreakpointException ( ) 
else 

SignalDebugModeBreakpointException ( ) 
endif 

Exceptions: 

Debug Breakpoint Exception 
Debug Mode Breakpoint Exception 
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SDC1 


Store Doubleword from Floating Point 
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Format: 

SDC1 

ft, 

offset (base) 



MIPS32 

Purpose: 









To store a doubleword from an FPR to memory 

Description: memory [base+off set ] 4— ft 

The 64-bit doubleword in FPR /f is stored in memory at the location specified by the aligned effective address. The 
16-bit signed offset is added to the contents of GPR base to form the effective address. 

Restrictions: 

An Address Error exception occurs if EffectiveAddresso () A 0 (not doubleword-aligned). 

Operation: 

vAddr 4— sign_extend (off set) + GPR [base] 
if vAddr 2 o ^ 0 3 then 

SignalException (AddressError) 
endif 

(pAddr, CCA) 4 — AddressTranslation (vAddr, DATA, STORE) 
lsw 4 - ValueFPR ( ft , UNINTERPRETED_WORD) 
msw 4 - ValueFPR (ft+1, UNINTERP RETED_WORD ) 

paddr 4 — paddr xor ( (BigEndianCPU xor ReverseEndian) || 0 2 ) 
paddr 4 — paddr xor 2#100 


Exceptions: 

Coprocessor Unusable, Reserved Instruction, TLB Refill, TLB Invalid, TLB Modified, Address Error, Watch 


MIPS32™ Architecture For Programmers Volume II, Revision 2.00 


247 


Copyright © 2001-2003 MIPS Technologies Inc. All rights reserved. 




Store Doubleword from Coprocessor 2 
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Format: 

SDC2 

rt, 

offset (base) 



MIPS32 

Purpose: 









To store a doubleword from a Coprocessor 2 register to memory 


Description: memory [base+off set ] 4— rt 

The 64-bit doubleword in Coprocessor 2 register rt is stored in memory at the location specified by the aligned effec- 
tive address. The 16-bit signed offset is added to the contents of GPR base to form the effective address. 

Restrictions: 

An Address Error exception occurs if EffectiveAddresso () A 0 (not doubleword-aligned). 

Operation: 

vAddr 4— sign_extend (off set) + GPR[base] 
if vAddr 2 q f 0 3 then 

SignalException (AddressError) 
endif 

(pAddr, CCA) 4 — AddressTranslation (vAddr, DATA, STORE) 
lsw 4 — CPR[2,rt,0] 
msw 4 — CPR [2, rt+1 , 0 ] 

paddr 4 — paddr xor ( (BigEndianCPU xor ReverseEndian) || 0 2 ) 
paddr 4 — paddr xor 2#100 


Exceptions: 

Coprocessor Unusable, Reserved Instruction, TLB Refill, TLB Invalid, TLB Modified, Address Error, Watch 
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Store Doubleword Indexed from Floating Point 
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Format: SDXC1 fs, index (base) 




MIPS64 





MIPS32 Release 2 

Purpose: 






To store a doubleword from an FPR to memory (GPR+GPR addressing) 

Description: memory [base+index] 4— fs 

The 64-bit doubleword in FPR /i is stored in memory at the location specified by the aligned effective address. The 
contents of GPR index and GPR base are added to form the effective address. 

Restrictions: 

An Address Error exception occurs if EffectiveAddresso () A () (not doubleword-aligned). 


Operation: 

vAddr 4 — GPR [base] + GPR [index] 
if vAddr 2 q A 0 3 then 

SignalException (AddressError ) 
endif 

(pAddr, CCA) 4 — AddressTranslation (vAddr, DATA, STORE) 
lsw 4 - ValueFPR ( ft , UNINTERP RETED_WORD ) 
msw 4 - ValueFPR (ft+1, UNINTERP RETED_WORD ) 

paddr 4 — paddr xor ( (BigEndianCPU xor ReverseEndian) || 0 2 ) 
paddr 4 — paddr xor 2#100 


Exceptions: 

TLB Refill, TLB Invalid, TLB Modified, Coprocessor Unusable, Address Error, Reserved Instruction, Watch. 
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Sign-Extend Byte 
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Format: seb rd, rt 




MIPS32 Release 2 

Purpose: 






To sign-extend the least significant byte of GPR rt and store the value into GPR rd. 

Description: rd <— SignExtend (rt 7- -0 ) 

The least significant byte from GPR rt is sign-extended and stored in GPR rd. 

Restrictions: 

In implementations prior to Release 2 of the architecture, this instruction resulted in a Reserved Instruction Excep- 
tion. 

Operation: 

GPR [ rd] <r- sign_extend (GPR [ rt ] 7 . _ 0 ) 

Exceptions: 

Reserved Instruction 

Programming Notes: 

For symmetry with the SEB and SEH instructions, one would expect that there would be ZEB and ZEH instructions 
that zero-extend the source operand. Similarly, one would expect that the SEW and ZEW instructions would exist to 
sign- or zero-extend a word to a doubleword. These instructions do not exist because there are functionally-equiva- 
lent instructions already in the instruction set. The following table shows the instructions providing the equivalent 


functions. 

Expected Instruction 

Function 

Equivalent Instruction 


ZEB rx,ry 

Zero-Extend Byte 

ANDI rx, ry, OxFF 


ZEH rx,ry 

Zero-Extend Halfword 

AND I rx, ry, OxFFFF 
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Sign-Extend Halfword 
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Format: seh rd, rt 




MIPS32 Release 2 

Purpose: 






To sign-extend the least significant halfword of GPR rt and store the value into GPR rd. 

Description: rd <— SignExtend (rt 15- _ 0 ) 

The least significant halfword from GPR rt is sign-extended and stored in GPR rd. 

Restrictions: 

In implementations prior to Release 2 of the architecture, this instruction resulted in a Reserved Instruction Excep- 
tion. 

Operation: 

GPR [ rd] <r- sign_extend (GPR [ rt ] 15 . _ 0 ) 

Exceptions: 

Reserved Instruction 

Programming Notes: 

The SEH instruction can be used to convert two contiguous halfwords to sign-extended word values in three instruc- 
tions. For example: 


lw 

to, 

0 (al) 

/* 

Read 

two contiguous 

halfwords */ 




seh 

tl, 

to 

/* 

tl = 

lower halfword 

sign-ext ended 

to 

word 

*/ 

sra 

to, 

to, 16 

/* 

to = 

upper halfword 

sign-ext ended 

to 

word 

*/ 


Zero-extended halfwords can be created by changing the SEH and SRA instructions to ANDI and SRL instructions, 
respectively. 
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Sign-Extend Halfword, cont. 


SEH 


For symmetry with the SEB and SEH instructions, one would expect that there would be ZEB and ZEH instructions 
that zero-extend the source operand. Similarly, one would expect that the SEW and ZEW instructions would exist to 
sign- or zero-extend a word to a doubleword. These instructions do not exist because there are functionally-equiva- 
lent instructions already in the instruction set. The following table shows the instructions providing the equivalent 
functions. 


Expected Instruction Function Equivalent Instruction 

ZEB rx,ry Zero-Extend Byte AND I rx, ry, OxFF 

ZEH rx,ry Zero-Extend Halfword AND I rx, ry, OxFFFF 
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Store Halfword 


SH 
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Format: SH rt, offset (base) MIPS32 

Purpose: 

To store a halfword to memory 

Description: memory [basetoff set ] 4— rt 

The least-significant 16-bit halfword of register rt is stored in memory at the location specified by the aligned effec- 
tive address. The 16-bit signed offset is added to the contents of GPR base to form the effective address. 

Restrictions: 

The effective address must be naturally-aligned. If the least-significant bit of the address is non-zero, an Address 
Error exception occurs. 

Operation: 

vAddr 4— sign_extend (off set) + GPR [base] 
if vAddr 0 ^ 0 then 

SignalException (AddressError) 
endif 

(pAddr, CCA) 4— AddressTranslation (vAddr, DATA, STORE) 
pAddr 4— pAddr PSIZE _ 1 2 I I (pAddrl lii0 xor (ReverseEndian | 0)) 
bytesel4— vAddrl x 0 xor (BigEndianCPU I | 0) 

dataworde- GPR[rt] 31 _ 8 * bytesel . .o || o 8 * bytesel 

StoreMemory (CCA, HALFWORD, dataword, pAddr, vAddr, DATA) 

Exceptions: 

TLB Refill, TLB Invalid, TLB Modified, Address Error, Watch 
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Shift Word Left Logical 
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Format: SLL rd, rt, sa MIPS32 

Purpose: 

To left-shift a word by a fixed number of bits 

Description: rd 4— rt « sa 

The contents of the low-order 32-bit word of GPR rt are shifted left, inserting zeros into the emptied bits; the word 
result is placed in GPR rd. The bit-shift amount is specified by sa. 

Restrictions: 

None 

Operation: 

s 4— sa 

temp 4- GPRfrt;] (31 _ s) . .o I! 0 s 

GPR[rd]4— temp 

Exceptions: 

None 

Programming Notes: 

SLL rO, rO, 0, expressed as NOP, is the assembly idiom used to denote no operation. 

SLL rO, rO, 1, expressed as SSNOP, is the assembly idiom used to denote no operation that causes an issue break on 
superscalar processors. 
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SLLV 


Shift Word Left Logical Variable 


31 26 25 21 20 16 15 11 10 6 5 0 


SPECIAL 




0 

SLLV 


rs 

rt 

rd 



000000 




00000 

000100 


6 5 

5 

5 

5 

6 

Format: SLLV rd, rt, rs 




MIPS32 


Purpose: To left-shift a word by a variable number of bits 
Description: rd rt « rs 

The contents of the low-order 32-bit word of GPR rt are shifted left, inserting zeros into the emptied bits; the result 
word is placed in GPR rd. The bit-shift amount is specified by the low-order 5 bits of GPR rs. 

Restrictions: None 

Operation: 

s <— GPR [rs] 4 _o 

temp <- GPR [ rt ] (31 - s) . . o II ° S 

GPR[rd]<— temp 

Exceptions: None 

Programming Notes: 

None 
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Set on Less Than 


SLT 


31 26 25 21 20 16 15 11 10 6 5 0 


SPECIAL 




0 

SLT 


rs 

rt 

rd 



000000 




00000 

101010 


6 5 5 5 5 6 


Format: SLT rd, rs, rt MIPS32 

Purpose: 

To record the result of a less-than comparison 
Description: rd <- (rs < rt) 

Compare the contents of GPR rs and GPR rt as signed integers and record the Boolean result of the comparison in 
GPR rd. If GPR rs is less than GPR rt, the result is 1 (true); otherwise, it is 0 (false). 

The arithmetic comparison does not cause an Integer Overflow exception. 

Restrictions: 

None 


Operation: 


if GPR [ rs ] 
GPR [ rd] 
else 

GPR [ rd] 
endif 


< GPR[rt] then 

qGPRLEN-1 | | ^ 

qGPRLEN 


Exceptions: 

None 
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Set on Less Than Immediate 
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Format: SLTI rt, rs, immediate MIPS32 

Purpose: 

To record the result of a less-than comparison with a constant 

Description: rt <— (rs < immediate) 

Compare the contents of GPR rs and the 16-bit signed immediate as signed integers and record the Boolean result of 
the comparison in GPR rt. If GPR rs is less than immediate , the result is 1 (true); otherwise, it is 0 (false). 

The arithmetic comparison does not cause an Integer Overflow exception. 

Restrictions: 

None 


Operation: 


if GPR [ rs ] 
GPR [rt] 
else 

GPR [ rt ] 
endif 


< sign_extend (immediate) 

qGPRLEN-1 | | 
qGPRLEN 


then 


Exceptions: 

None 


MIPS32™ Architecture For Programmers Volume II, Revision 2.00 


257 


Copyright © 2001-2003 MIPS Technologies Inc. All rights reserved. 




Set on Less Than Immediate Unsigned 
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Format: SLTIU rt, rs, immediate MIPS32 

Purpose: 

To record the result of an unsigned less-than comparison with a constant 

Description: rt <— (rs < immediate) 

Compare the contents of GPR rs and the sign-extended 16-bit immediate as unsigned integers and record the Boolean 
result of the comparison in GPR rt. If GPR rs is less than immediate, the result is 1 (true); otherwise, it is 0 (false). 

Because the 16-bit immediate is sign-extended before comparison, the instruction can represent the smallest or largest 
unsigned numbers. The representable values are at the minimum [0, 32767] or maximum [max_unsigned-32767, 
max_unsigned] end of the unsigned range. 

The arithmetic comparison does not cause an Integer Overflow exception. 

Restrictions: 

None 


Operation: 

sign_extend (immediate) ) then 

1 

endif 

Exceptions: 

None 


if (0 1 | GPR [ rs ] ) < (0 II 
GPR [ rt ] o GPRLEN_1 | | 
sis© 

GPR f rt 1 0 GPRLEN 
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Set on Less Than Unsigned 
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Format: SLTU rd, rs, rt MIPS32 

Purpose: 

To record the result of an unsigned less-than comparison 
Description: rd <- (rs < rt) 

Compare the contents of GPR rs and GPR rt as unsigned integers and record the Boolean result of the comparison in 
GPR rd. If GPR rs is less than GPR rt, the result is 1 (true); otherwise, it is 0 (false). 

The arithmetic comparison does not cause an Integer Overflow exception. 

Restrictions: 

None 

Operation: 

if (0 || GPRfrs] ) < (0 | | GPR[rt] ) then 
GPR [ rd] <- o GPRLEN_1 | ] 1 
else 

GPR [ rd] <- 0 GPRLEN 
endif 

Exceptions: 

None 
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SQRT.fmt 
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Format: 

SQRT 

,S 

fd, 

f s 










MIPS32 


SQRT 

. D 

fd, 

f s 










MIPS32 


Purpose: 

To compute the square root of an FP value 
Description: fd <- SQRT(fs) 

The square root of the value in FPR fs is calculated to infinite precision, rounded according to the current rounding 
mode in FCSR , and placed into FPR fd. The operand and result are values in format fmt. 

If the value in FPR/? corresponds to - 0, the result is - 0. 

Restrictions: 

If the value in FPR fs is less than 0, an Invalid Operation condition is raised. 

The fields fs and fd must specify FPRs valid for operands of type fmt\ if they are not valid, the result is UNPRE- 
DICTABLE. 

The operand must be a value in format fmt; if it is not, the result is UNPREDICTABLE and the value of the operand 
FPR becomes UNPREDICTABLE. 

Operation: 

StoreFPR(fd, fmt, SquareRoot (ValueFPR ( f s , fmt))) 

Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Floating Point Exceptions: 

Invalid Operation, Inexact, Unimplemented Operation 
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Shift Word Right Arithmetic 
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Format: SRA rd, rt, sa 




MIPS32 

Purpose: 






To execute an arithmetic right-shift of a word by a fixed number of bits 

Description: rd 4— rt >> sa (arithmetic) 

The contents of the low-order 32-bit word of GPR rt are shifted right, duplicating the sign-bit (bit 31) in the emptied 
bits; the word result is placed in GPR rd. The bit-shift amount is specified by sa. 

Restrictions: 

None 

Operation: 

s 4— sa 

temp 4— (GPR[rt] 31 ) s | I GPR[rt] 3 1--s 

GPR[rd]4— temp 

Exceptions: None 
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SRAV 


Shift Word Right Arithmetic Variable 
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Format: SRAV rd, rt, rs 




MIPS32 

Purpose: 






To execute an arithmetic right-shift of a word by a variable number of bits 

Description: rd 4— rt >> rs (arithmetic) 

The contents of the low-order 32-bit word of GPR rt are shifted right, duplicating the sign-bit (bit 31) in the emptied 
bits; the word result is placed in GPR rd. The bit-shift amount is specified by the low-order 5 bits of GPR rs. 

Restrictions: 

None 

Operation: 

s 4- GPRfrs] 4 _o 

temp 4— (GPR[rt] 31 ) s | I GPR[rt] 3 1--s 

GPR[rd]4— temp 

Exceptions: 

None 
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Shift Word Right Logical 
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Format: SRL rd, rt, sa 




MIPS32 

Purpose: 






To execute a logical right-shift of a word by a fixed number of bits 

Description: rd 4— rt >> sa (logical) 

The contents of the low-order 32-bit word of GPR rt are shifted right, inserting zeros into the emptied bits; the word 
result is placed in GPR rd. The bit-shift amount is specified by sa. 

Restrictions: 

None 

Operation: 

s 4— sa 

temp 4— 0 s | ] GPR[rt] 3 1--s 

GPR[rd]4— temp 

Exceptions: 

None 
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Shift Word Right Logical Variable 
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Format: SRLV rd, rt, rs 





MIPS32 

Purpose: 







To execute a logical right-shift of a word by a variable number of bits 

Description: rd 4— rt >> rs (logical) 

The contents of the low-order 32-bit word of GPR rt are shifted right, inserting zeros into the emptied bits; the word 
result is placed in GPR rd. The bit-shift amount is specified by the low-order 5 bits of GPR rs. 

Restrictions: 

None 

Operation: 

s 4- GPRfrs] 4 _o 

temp 4— 0 s | | GPR[rt] 3 1--s 

GPR[rd]4— temp 

Exceptions: 

None 
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Superscalar No Operation 
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Format: SSNOP MIPS32 

Purpose: 

Break superscalar issue on a superscalar processor. 

Description: 

SSNOP is the assembly idiom used to denote superscalar no operation. The actual instruction is interpreted by the 
hardware as SLL rO, rO, 1 . 

This instruction alters the instruction issue behavior on a superscalar processor by forcing the SSNOP instruction to 
single-issue. The processor must then end the current instruction issue between the instruction previous to the SSNOP 
and the SSNOP. The SSNOP then issues alone in the next issue slot. 

On a single-issue processor, this instruction is a NOP that takes an issue slot. 

Restrictions: 

None 

Operation: 

None 

Exceptions: 

None 

Programming Notes: 

SSNOP is intended for use primarily to allow the programmer control over CPO hazards by converting instructions 
into cycles in a superscalar processor. For example, to insert at least two cycles between an MTCO and an ERET, one 
would use the following sequence: 

mtcO x, y 
ssnop 
ssnop 
eret 

Based on the normal issues rules of the processor, the MTCO issues in cycle T. Because the SSNOP instructions must 
issue alone, they may issue no earlier than cycle T+l and cycle T+2, respectively. Finally, the ERET issues no earlier 
than cycle T+3. Note that although the instruction after an SSNOP may issue no earlier than the cycle after the 
SSNOP is issued, that instruction may issue later. This is because other implementation-dependent issue rules may 
apply that prevent an issue in the next cycle. Processors should not introduce any unnecessary delay in issuing 
SSNOP instructions. 


MIPS32™ Architecture For Programmers Volume II, Revision 2.00 


265 


Copyright © 2001-2003 MIPS Technologies Inc. All rights reserved. 





Subtract Word 
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Format: SUB rd, rs, rt 




MIPS32 

Purpose: 






To subtract 32-bit integers. If overflow occurs, then trap 

Description: rd <- rs - rt 

The 32-bit word value in GPR rt is subtracted from the 32-bit value in GPR rs to produce a 32-bit result. If the sub- 
traction results in 32-bit 2’s complement arithmetic overflow, then the destination register is not modified and an Inte- 
ger Overflow exception occurs. If it does not overflow, the 32-bit result is placed into GPR rd. 

Restrictions: 

None 

Operation: 

temp <- (GPR [rs] 31 | | GPR [rs] 31 . -0 ) - (GPR [ rt ] 31 | | GPR [ rt ] 31 .. 0 ) 
if temp 32 temp 31 then 

SignalException ( IntegerOverf low) 
else 

GPR[rd] temp 31--0 
endif 

Exceptions: 

Integer Overflow 

Programming Notes: 

SUBU performs the same arithmetic operation but does not trap on overflow. 
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Floating Point Subtract 
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Format: SUB.S fd, fs, ft 

SUB.D fd, fs, ft 
SUB. PS fd, fs, ft 


MIPS32 
MIPS32 
MIPS64 
MIPS32 Release 2 


Purpose: 

To subtract FP values 

Description: fd «- fs - ft 

The value in FPR ft is subtracted from the value in FPR fs. The result is calculated to infinite precision, rounded 
according to the current rounding mode in FCSR , and placed into FPR fd. The operands and result are values in for- 
mat fmt. SUB. PS subtracts the upper and lower halves of FPR fs and FPR ft independently, and ORs together any 
generated exceptional conditions. 

Restrictions: 

The fields fs.ft, and fd must specify FPRs valid for operands of type fmt. If they are not valid, the result is UNPRE- 
DICTABLE. 

The operands must be values in format fmt; if they are not, the result is UNPREDICTABLE and the value of the 
operand FPRs becomes UNPREDICTABLE. 

The result of SUB. PS is UNPREDICTABLE if the processor is executing in 16 FP registers mode. 

Operation: 

StoreFPR (fd, fmt, ValueFPR(fs, fmt) — fmt ValueFPR(ft, fmt)) 

CPU Exceptions: 

Coprocessor Unusable, Reserved Instruction 

FPU Exceptions: 

Inexact, Overflow, Underflow, Invalid Op, Unimplemented Op 


MIPS32™ Architecture For Programmers Volume II, Revision 2.00 


267 


Copyright © 2001-2003 MIPS Technologies Inc. All rights reserved. 




SUBU 
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Format: SUBU rd, rs, rt 

Purpose: 

To subtract 32-bit integers 

Description: rd f- rs - rt 




MIPS32 


The 32-bit word value in GPR rt is subtracted from the 32-bit value in GPR rs and the 32-bit arithmetic result is and 
placed into GPR rd. 

No integer overflow exception occurs under any circumstances. 

Restrictions: 

None 

Operation: 

temp <— GPR[rs] - GPRfrt] 

GPR[rd]<— temp 

Exceptions: 

None 

Programming Notes: 

The term “unsigned” in the instruction name is a misnomer; this operation is 32-bit modulo arithmetic that does not 
trap on overflow. It is appropriate for unsigned arithmetic, such as address arithmetic, or integer arithmetic environ- 
ments that ignore overflow, such as C language arithmetic. 
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Store Doubleword Indexed Unaligned from Floating Point 


SUXC1 


31 26 25 21 20 16 15 11 10 6 5 0 


COP IX 




0 

SUXC1 


base 

index 

fs 



010011 




00000 

001101 


6 5 

5 

5 

5 

6 

Format: SUXC1 fs, index (base) 




MIPS64 





MIPS32 Release 2 

Purpose: 






To store a doubleword from an FPR to memory (GPR+GPR addressing) ignoring alignment 

Description: memory [ (base+index) PSIZE _ 1 3 ] 4— fs 

The contents of the 64-bit doubleword in FPR fs is stored at the memory location specified by the effective address. 
The contents of GPR index and GPR base are added to form the effective address. The effective address is double- 
word-aligned; EffectiveAddresso 0 are ignored. 

Restrictions: 

The result of this instruction is UNPREDICTABLE if the processor is executing in 16 FP registers mode. 


Operation: 

vAddr 4— (GPR [base ] +GPR [ index] ) 63 __ 3 | | 0 3 

(pAddr, CCA) 4— AddressTranslation (vAddr, DATA, STORE) 
lsw 4- ValueFPR ( ft , UNINTERP RETED_WORD ) 
msw 4- ValueFPR (ft+1, UNINTERP RETED_WORD ) 

paddr 4— paddr xor ( (BigEndianCPU xor ReverseEndian) || 0 2 ) 
paddr 4— paddr xor 2#100 


Exceptions: 

Coprocessor Unusable, Reserved Instruction, TLB Refill, TLB Invalid, TLB Modified, Watch 


MIPS32™ Architecture For Programmers Volume II, Revision 2.00 


269 


Copyright © 2001-2003 MIPS Technologies Inc. All rights reserved. 




Store Word 


SW 


31 26 25 21 20 16 15 


0 


SW 

101011 


base 


rt 


offset 


6 


5 


5 


16 


Format: SW rt, offset (base) MIPS32 

Purpose: 

To store a word to memory 

Description: memory [basetof f set ] rt 

The least-significant 32-bit word of register rt is stored in memory at the location specified by the aligned effective 
address. The 16-bit signed offset is added to the contents of GPR base to form the effective address. 

Restrictions: 

The effective address must be naturally-aligned. If either of the 2 least-significant bits of the address is non-zero, an 
Address Error exception occurs. 

Operation: 

vAddr <r- sign_extend (off set) + GPR [base] 
if vAddr 1# 0 7^ 0 2 then 

SignalException (AddressError ) 
endif 

(pAddr, CCA) 4— AddressTranslation (vAddr, DATA, STORE) 
dataworde- GPR[rt] 

StoreMemory (CCA, WORD, dataword, pAddr, vAddr, DATA) 

Exceptions: 

TLB Refill, TLB Invalid, TLB Modified, Address Error, Watch 
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Store Word from Floating Point 


SWC1 


31 


26 25 

21 20 

16 

15 

0 


SWC1 

111001 

base 


ft 

offset 


6 

5 


5 

16 



Format: SWC1 ft, offset (base) MIPS32 

Purpose: 

To store a word from an FPR to memory 

Description: memory [base+off set ] <— ft 

The low 32-bit word from FPR/f is stored in memory at the location specified by the aligned effective address. The 
16-bit signed offset is added to the contents of GPR base to form the effective address. 

Restrictions: 

An Address Error exception occurs if Effective Address] () A 0 (not word-aligned). 

Operation: 

vAddr <r- sign_extend (off set) + GPR [base] 
if vAddr-L ^ 0 3 then 

SignalException (AddressError) 
endif 

(pAddr, CCA) <— AddressTranslation (vAddr, DATA, STORE) 
dataword <- ValueFPR(ft, UNINTERPRETED_WORD) 

StoreMemory (CCA, WORD, dataword, pAddr, vAddr, DATA) 

Exceptions: 

Coprocessor Unusable, Reserved Instruction, TLB Refill, TLB Invalid, TLB Modified, Address Error, Watch 
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SWC2 


Store Word from Coprocessor 2 


31 26 25 21 20 16 15 


0 


SWC2 

111010 


base 


rt 


offset 


6 


5 


5 


16 


Format: SWC2 rt, offset (base) MIPS32 

Purpose: 

To store a word from a COP2 register to memory 

Description: memory [base+off set ] 4— rt 

The low 32-bit word from COP2 (Coprocessor 2) register rt is stored in memory at the location specified by the 
aligned effective address. The 16-bit signed offset is added to the contents of GPR base to form the effective address. 

Restrictions: 

An Address Error exception occurs if EffectiveAddress | () A () (not word-aligned). 


Operation: 

vAddr 4— sign_extend (off set) + GPR[base] 
if vAddr 2 o ^ 0 3 then 

SignalException (AddressError) 
endif 

(pAddr, CCA) 4 — AddressTranslation (vAddr, DATA, STORE) 
dataword 4— CPR[2,rt,0] 

StoreMemory (CCA, WORD, dataword, pAddr, vAddr, DATA) 

Exceptions: 

Coprocessor Unusable, Reserved Instruction, TLB Refill, TLB Invalid, TLB Modified, Address Error, Watch 
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Store Word Left 


SWL 



Format: SWL rt, offset (base) MIPS32 

Purpose: 

To store the most-significant part of a word to an unaligned memory address 

Description: memory [base+off set ] 4— rt 

The 16-bit signed offset is added to the contents of GPR base to form an effective address (EffAddr). EjfAddr is the 
address of the most-significant of 4 consecutive bytes forming a word (W) in memory starting at an arbitrary byte 
boundary. 

A part of W, the most-significant 1 to 4 bytes, is in the aligned word containing EjfAddr. The same number of the 
most-significant (left) bytes from the word in GPR rt are stored into these bytes of W. 

The following figure illustrates this operation using big-endian byte ordering for 32-bit and 64-bit registers. The 4 
consecutive bytes in 2. .5 form an unaligned word starting at location 2. A part of W, 2 bytes, is located in the aligned 
word containing the most-significant byte at 2. First, SWL stores the most-significant 2 bytes of the low word from 
the source register into these 2 bytes in memory. Next, the complementary SWR stores the remainder of the unaligned 
word. 

Figure 3-9 Unaligned Word Store Using SWL and SWR 



The bytes stored from the source register to memory depend on both the offset of the effective address within an 
aligned word — that is, the low 2 bits of the address ( vAddrl.,0 ) — and the current byte-ordering mode of the processor 
(big- or little-endian). The following figure shows the bytes stored for every combination of offset and byte ordering. 
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Store Word Left (cont.) SWL 

Figure 3-10 Bytes Stored by an SWL Instruction 
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Restrictions: 

None 

Operation: 

vAddr sign_extend (off set) + GPR[base] 

(pAddr, CCA ) <— AddressTranslation (vAddr, DATA, STORE) 

o 

pAddr <— pAddr PSIZE _ 1 2 I I (pAddr p 0 xor ReverseEndian ) 
If BigEndianMem = 0 then 

pAddr <— pAddr PSIZE _ 1 _ 2 I I 0 2 
endif 

byte <— vAddr x..O xor BigEndianCPU 2 
dataworde- o 24_8 * b ' yte II GPR [ rt ] 31 _ _ 2 4 - 8 *byte 
StoreMemory (CCA, byte, dataword, pAddr, vAddr, DATA) 

Exceptions: 

TLB Refill, TLB Invalid, TLB Modified, Bus Error, Address Error , Watch 
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Store Word Right 


SWR 



Format: SWR rt, offset (base) MIPS32 

Purpose: 

To store the least-significant part of a word to an unaligned memory address 

Description: memory [base+off set ] 4— rt 

The 16-bit signed offset is added to the contents of GPR base to form an effective address (EffAddr). EffAddr is the 
address of the least-significant of 4 consecutive bytes forming a word (W) in memory starting at an arbitrary byte 
boundary. 

A part of W, the least-significant 1 to 4 bytes, is in the aligned word containing EffAddr. The same number of the 
least-significant (right) bytes from the word in GPR rt are stored into these bytes of W. 

The following figure illustrates this operation using big-endian byte ordering for 32-bit and 64-bit registers. The 4 
consecutive bytes in 2. .5 form an unaligned word starting at location 2. A part of W, 2 bytes, is contained in the 
aligned word containing the least-significant byte at 5. First, SWR stores the least-significant 2 bytes of the low word 
from the source register into these 2 bytes in memory. Next, the complementary SWL stores the remainder of the 
unaligned word. 

Figure 3-11 Unaligned Word Store Using SWR and SWU 
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Store Word Right (cont.) 


SWR 


The bytes stored from the source register to memory depend on both the offset of the effective address within an 
aligned word — that is, the low 2 bits of the address ( yAddrl.,0 ) — and the current byte-ordering mode of the processor 
(big- or little-endian). The following figure shows the bytes stored for every combination of offset and byte-ordering. 

Figure 3-12 Bytes Stored by SWR Instruction 
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Restrictions: 

None 

Operation: 

vAddr 4— sign_extend (off set) + GPR[base] 

(pAddr, CCA) 4— AddressTranslation (vAddr, DATA, STORE) 

2 

pAddr 4— pAddr PSIZE _ 1 _ I I (pAddr^ g xor ReverseEndian ) 
If BigEndianMem = 0 then 

pAddr 4— pAddr PSIZE _ 1 2 M 0 2 
endif 

byte 4— vAddr 1-iQ xor BigEndianCPU 2 
dataword4— GPR [ rt ] 31 _ 8 * byte I I 0 8 * byte 

StoreMemory (CCA, WORD-byte, dataword, pAddr, vAddr, DATA) 

Exceptions: 

TLB Refill, TLB Invalid, TLB Modified, Bus Error, Address Error, Watch 
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Store Word Indexed from Floating Point 


SWXC1 


31 26 25 21 20 16 15 11 10 6 5 0 


COP IX 




0 

SWXC1 


base 

index 

fs 



010011 




00000 

001000 


6 5 

5 

5 

5 

6 

Format: SWXC1 fs, index (base) 




MIPS64 





MIPS32 Release 2 

Purpose: 






To store a word from an FPR to memory (GPR+GPR addressing) 

Description: memory [base+index] <— fs 

The low 32-bit word from FPR /s is stored in memory at the location specified by the aligned effective address. The 
contents of GPR index and GPR base are added to form the effective address. 

Restrictions: 

An Address Error exception occurs if EficctiveAddrcss | () A () ( no t word-aligned). 


Operation: 

vAddr 4— GPR [base] + GPR [index] 
if vAddr-L q A 0 3 then 

SignalException (AddressError ) 
endif 

(pAddr, CCA) <— AddressTranslation (vAddr, DATA, STORE) 
dataword <- ValueFPR(ft, UNINTERPRETED_WORD) 

StoreMemory (CCA, WORD, dataword, pAddr, vAddr, DATA) 

Exceptions: 

TLB Refill, TLB Invalid, TLB Modified, Address Error, Reserved Instruction, Coprocessor Unusable, Watch 
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SYNC 


Synchronize Shared Memory 


31 26 25 21 20 16 15 11 10 6 5 0 


SPECIAL 
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stype 

SYNC 
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00 0000 0000 0000 0 


001111 
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6 

Format: sync 

(stype = 0 implied) 


MIPS32 


Purpose: 

To order loads and stores. 

Description: 

Simple Description: 

• SYNC affects only uncached and cached coherent loads and stores. The loads and stores that occur before the SYNC 
must be completed before the loads and stores after the SYNC are allowed to start. 

• Loads are completed when the destination register is written. Stores are completed when the stored value is visible to 
every other processor in the system. 

• SYNC is required, potentially in conjunction with SSNOP, to guarantee that memory reference results are visible 
across operating mode changes. For example, a SYNC is required on some implementations on entry to and exit 
from Debug Mode to guarantee that memory affects are handled correctly. 

Detailed Description: 

• When the stype field has a value of zero, every synchronizable load and store that occurs in the instruction stream 
before the SYNC instruction must be globally performed before any synchronizable load or store that occurs after the 
SYNC can be performed, with respect to any other processor or coherent I/O module. 

• SYNC does not guarantee the order in which instruction fetches are performed. The stype values 1-31 are reserved 
for future extensions to the architecture. A value of zero will always be defined such that it performs all defined 
synchronization operations. Non-zero values may be defined to remove some synchronization operations. As such, 
software should never use a non-zero value of the stype field, as this may inadvertently cause future failures if 
non-zero values remove synchronization operations. 
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SYNC 


Synchronize Shared Memory (cont.) 


Terms: 

Synchronizable : A load or store instmction is synchronizable if the load or store occurs to a physical location in 
shared memory using a virtual location with a memory access type of either uncached or cached coherent. Shared 
memory is memory that can be accessed by more than one processor or by a coherent I/O system module. 

Performed load: A load instruction is performed when the value returned by the load has been determined. The result 
of a load on processor A has been determined with respect to processor or coherent I/O module B when a subsequent 
store to the location by B cannot affect the value returned by the load. The store by B must use the same memory 
access type as the load. 

Performed store: A store instruction is performed when the store is observable. A store on processor A is observable 
with respect to processor or coherent I/O module B when a subsequent load of the location by B returns the value 
written by the store. The load by B must use the same memory access type as the store. 

Globally performed load: A load instruction is globally performed when it is performed with respect to all processors 
and coherent I/O modules capable of storing to the location. 

Globally performed store: A store instruction is globally performed when it is globally observable. It is globally 
observable when it is observable by all processors and I/O modules capable of loading from the location. 

Coherent I/O module: A coherent I/O module is an Input/Output system component that performs coherent Direct 
Memory Access (DMA). It reads and writes memory independently as though it were a processor doing loads and 
stores to locations with a memory access type of cached coherent. 
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Synchronize Shared Memory (cont.) 


SYNC 


Restrictions: 

The effect of SYNC on the global order of loads and stores for memory access types other than uncached and cached 

coherent is UNPREDICTABLE. 

Operation: 

SyncOperation (stype) 

Exceptions: 

None 

Programming Notes: 

A processor executing load and store instructions observes the order in which loads and stores using the same mem- 
ory access type occur in the instruction stream; this is known as program order. 

A parallel program has multiple instruction streams that can execute simultaneously on different processors. In mul- 
tiprocessor (MP) systems, the order in which the effects of loads and stores are observed by other processors — the 
global order of the loads and store — determines the actions necessary to reliably share data in parallel programs. 

When all processors observe the effects of loads and stores in program order, the system is strongly ordered. On such 
systems, parallel programs can reliably share data without explicit actions in the programs. For such a system, SYNC 
has the same effect as a NOP. Executing SYNC on such a system is not necessary, but neither is it an error. 

If a multiprocessor system is not strongly ordered, the effects of load and store instructions executed by one processor 
may be observed out of program order by other processors. On such systems, parallel programs must take explicit 
actions to reliably share data. At critical points in the program, the effects of loads and stores from an instruction 
stream must occur in the same order for all processors. SYNC separates the loads and stores executed on the proces- 
sor into two groups, and the effect of all loads and stores in one group is seen by all processors before the effect of 
any load or store in the subsequent group. In effect, SYNC causes the system to be strongly ordered for the executing 
processor at the instant that the SYNC is executed. 

Many MIPS-based multiprocessor systems are strongly ordered or have a mode in which they operate as strongly 
ordered for at least one memory access type. The MIPS architecture also permits implementation of MP systems that 
are not strongly ordered; SYNC enables the reliable use of shared memory on such systems. A parallel program that 
does not use SYNC generally does not operate on a system that is not strongly ordered. However, a program that does 
use SYNC works on both types of systems. (System-specific documentation describes the actions needed to reliably 
share data in parallel programs for that system.) 

The behavior of a load or store using one memory access type is UNPREDICTABLE if a load or store was previ- 
ously made to the same physical location using a different memory access type. The presence of a SYNC between the 
references does not alter this behavior. 
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SYNC 


Synchronize Shared Memory (cont.) 

SYNC affects the order in which the effects of load and store instructions appear to all processors; it does not gener- 
ally affect the physical memory-system ordering or synchronization issues that arise in system programming. The 
effect of SYNC on implementation-specific aspects of the cached memory system, such as writeback buffers, is not 
defined. The effect of SYNC on reads or writes to memory caused by privileged implementation-specific instruc- 
tions, such as CACHE, also is not defined. 

# Processor A (writer) 

# Conditions at entry: 

# The value 0 has been stored in FLAG and that value is observable by B 


sw 

Rl, 

DATA 

# 

change shared DATA value 

LI 

R2 , 

1 



SYNC 


# 

Perforin DATA store before performing FLAG store 

SW 

R2 , 

FLAG 

# 

say that the shared DATA value is valid 


# Processor B 

(reader) 



LI 

R2 , 

1 



1 : LW 

Rl, 

FLAG # 

Get FLAG 


BNE 

R2 , 

Rl, IB# 

if it says that DATA is not valid, poll again 


NOP 





SYNC 

# 

FLAG value checked before doing DATA read 


LW 

Rl, 

DATA # 

Read (valid) shared DATA value 


Prefetch operations have no effect detectable by User-mode programs, so ordering the effects of prefetch operations 
is not meaningful. 

The code fragments above shows how SYNC can be used to coordinate the use of shared data between separate writer 
and reader instruction streams in a multiprocessor environment. The FLAG location is used by the instruction streams 
to determine whether the shared data item DATA is valid. The SYNC executed by processor A forces the store of 
DATA to be performed globally before the store to FLAG is performed. The SYNC executed by processor B ensures 
that DATA is not read until after the FLAG value indicates that the shared data is valid. 


MIPS32™ Architecture For Programmers Volume II, Revision 2.00 


281 


Copyright © 2001-2003 MIPS Technologies Inc. All rights reserved. 




SYNCI 


Synchronize Caches to Make Instruction Writes Effective 


31 26 25 21 20 16 15 0 


REGIMM 


SYNCI 



base 


offset 

000001 


11111 



6 5 

5 

16 


Format: SYNCI offset (base) 



MIPS32 Release 2 

Purpose: 





To synchronize all caches to make instruction writes effective. 

Description: 

This instruction is used after a new instruction stream is written to make the new instructions effective relative to an 
instruction fetch, when used in conjunction with the SYNC and JALR.HB, JR.HB, or ERET instructions, as 
described below. Unlike the CACHE instruction, the SYNCI instruction is available in all operating modes in an 
implementation of Release 2 of the architecture. 

The 16-bit offset is sign-extended and added to the contents of the base register to form an effective address. The 
effective address is used to address the cache line in all caches which may need to be synchronized with the write of 
the new instructions. The operation occurs only on the cache line which may contain the effective address. One 
SYNCI instruction is required for every cache line that was written. See the Programming Notes below. 

A TLB Refill and TLB Invalid (both with cause code equal TLBL) exception can occur as a byproduct of this instruc- 
tion. This instruction never causes TLB Modified exceptions nor TLB Refill exceptions with a cause code of TLBS. 

A Cache Error exception may occur as a byproduct of this instruction. Lor example, if a writeback operation detects a 
cache or bus error during the processing of the operation, that error is reported via a Cache Error exception. Similarly, 
a Bus Error Exception may occur if a bus operation invoked by this instruction is terminated in an error. 

An Address Error Exception (with cause code equal AdEL) may occur if the effective address references a portion of 
the kernel address space which would normally result in such an exception. It is implementation dependent whether 
such an exception does occur. 

It is implementation dependent whether a data watch is triggered by a SYNCI instruction whose address matches the 
Watch register address match conditions. 

Restrictions: 

The operation of the processor is UNPREDICTABLE if the effective address references any instruction cache line 
that contains instructions to be executed between the SYNCI and the subsequent JALR.HB, JR.HB, or ERET instruc- 
tion required to clear the instruction hazard. 

The SYNCI instruction has no effect on cache lines that were prevsiously locked with the CACHE instruction. If cor- 
rect software operation depends on the state of a locked line, the CACHE instruction must be used to synchronize the 
caches. 

The SYNCI instruction acts only on the current processor. It doesn’t not affect the caches on other processors in a 
multi-processor system, except as required to perform the operation on the current processor (as might be the case if 
multiple processors share an L2 or L3 cache). 
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SYNCI 


Synchronize Caches to Make Instruction Writes Effective, cont. 


Operation: 

vaddr GPRfbase] + sign_extend (off set) 

SynchronizeCacheLines (vaddr) /* Operate on all caches */ 

Exceptions: 

Reserved Instruction Exception (Release 1 implementations only) 

TLB Refill Exception 
TLB Invalid Exception 
Address Error Exception 
Cache Enor Exception 
Bus Error Exception 
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Synchronize Caches to Make Instruction Writes Effective, cont. 


SYNCI 


Programming Notes: 

When the instruction stream is written, the SYNCI instruction should be used in conjunction with other instructions 
to make the newly-written instructions effective. The following example shows a routine which can be called after 
the new instruction stream is written to make those changes effective. Note that the SYNCI instruction could be 
replaced with the corresponding sequence of CACHE instructions (when access to Coprocessor 0 is available), and 
that the JR.HB instruction could be replaced with JALR.HB, ERET, or DERET instructions, as appropriate. 

/* 

* This routine makes changes to the instruction stream effective to the 

* hardware. It should be called after the instruction stream is written. 

* On return, the new instructions are effective. 

•k 

* Inputs: 

* aO = Start address of new instruction stream 

* al = Size, in bytes, of new instruction stream 
*/ 


addu 

al. 

aO , al 

/* 

Calculate end address + 1 */ 

rdhwr 

vO, 

HW_SYNCI_Step 

/* 

Get step size for SYNCI from new */ 




/* 

Release 2 instruction */ 

beq 

vO, 

zero, 20f 

/* 

If no caches require synchronization, */ 

nop 



/* 

branch around */ 

synci 

0 (aO) 

/* 

Synchronize all caches around address */ 

situ 

vl, 

aO , al 

/* 

Compare current with end address */ 

bne 

vl, 

zero, 10b 

/* 

Branch if more to do */ 

addu 

aO , 

aO , vO 

/* 

Add step size in delay slot */ 

sync 



/* 

Clear memory hazards */ 

j r . hb 

ra 


/* 

Return, clearing instruction hazards */ 


nop 
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System Call 


SYSCALL 

31 26 25 


6 5 0 

SPECIAL 

code 

SYSCALL 

000000 


001100 

6 

20 

6 

Format: syscall 
Purpose: 

To cause a System Call exception 

Description: 


MIPS32 


A system call exception occurs, immediately and unconditionally transferring control to the exception handler. 

The code field is available for use as software parameters, but is retrieved by the exception handler only by loading 
the contents of the memory word containing the instruction. 

Restrictions: 

None 

Operation: 

SignalException (SystemCall) 

Exceptions: 

System Call 
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Trap if Equal 


TEQ 



Format: TEQ rs, rt MIPS32 

Purpose: 

To compare GPRs and do a conditional trap 

Description: if rs = rt then Trap 

Compare the contents of GPR rs and GPR rt as signed integers; if GPR rs is equal to GPR rt, then take a Trap excep- 
tion. 

The contents of the code field are ignored by hardware and may be used to encode information for system software. 
To retrieve the information, system software must load the instruction word from memory. 

Restrictions: 

None 

Operation: 

if GPR[rs] = GPR[rt] then 
SignalException (Trap) 
endif 

Exceptions: 

Trap 
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Trap if Equal Immediate 


TEQI 


31 26 25 21 20 16 15 0 


REGIMM 

000001 

rs 

TEQI 

01100 

immediate 

6 

5 

5 

16 

Format: teqi 

rs, immediate 


MIPS32 

Purpose: 





To compare a GPR to a constant and do a conditional trap 

Description: if rs = immediate then Trap 

Compare the contents of GPR rs and the 16-bit signed immediate as signed integers; if GPR rs is equal to immediate, 
then take a Trap exception. 

Restrictions: 

None 

Operation: 

if GPR[rs] = sign_extend (immediate) then 
SignalException (Trap) 

endif 

Exceptions: 

Trap 
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21 20 


16 15 


6 5 


Trap if Greater or Equal 


TGE 


31 26 25 


0 


SPECIAL 




TGE 


rs 

rt 

code 


000000 




110000 


655 10 6 


Format: TGE rs, rt MIPS32 

Purpose: 

To compare GPRs and do a conditional trap 

Description: if rs > rt then Trap 

Compare the contents of GPR rs and GPR rt as signed integers; if GPR rs is greater than or equal to GPR rt, then take 
a Trap exception. 

The contents of the code field are ignored by hardware and may be used to encode information for system software. 
To retrieve the information, system software must load the instruction word from memory. 

Restrictions: 

None 

Operation: 

if GPR[rs] > GPR[rt] then 
SignalException (Trap) 
endif 

Exceptions: 

Trap 
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Trap if Greater or Equal Immediate 


TGEI 


31 26 25 

21 20 

16 

15 

0 

REGIMM 
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TGEI 

immediate 
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01000 
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16 


Format: tgei rs, 

immediate 



MIPS32 

Purpose: 






To compare a GPR to a constant and do a conditional trap 

Description: if rs > immediate then Trap 

Compare the contents of GPR rs and the 16-bit signed immediate as signed integers; if GPR rs is greater than or equal 
to immediate , then take a Trap exception. 

Restrictions: 

None 

Operation: 

if GPR[rs] > sign_extend (immediate) then 
SignalException (Trap) 
endif 

Exceptions: 

Trap 
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TGEIU 


Trap if Greater or Equal Immediate Unsigned 


31 26 25 21 20 16 15 0 


REGIMM 


TGEIU 



rs 


immediate 

000001 


01001 



6 5 

5 

16 


Format: TGEIU rs, immediate 



MIPS32 

Purpose: 





To compare a GPR to a constant and do a conditional trap 

Description: if rs > immediate then Trap 

Compare the contents of GPR rs and the 16-bit sign-extended immediate as unsigned integers; if GPR rs is greater 
than or equal to immediate , then take a Trap exception. 

Because the 16-bit immediate is sign-extended before comparison, the instruction can represent the smallest or largest 
unsigned numbers. The representable values are at the minimum [0, 32767] or maximum [max_unsigned-32767, 
max_unsigned] end of the unsigned range. 

Restrictions: 

None 

Operation: 

if (0 j [ GPRfrs]) > (0 | | sign_extend (immediate) ) then 
SignalException (Trap) 
endif 

Exceptions: 

Trap 
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TGEU 


Trap if Greater or Equal Unsigned 


31 26 25 21 20 16 15 6 5 0 
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TGEU 


rs 

rt 

code 


000000 




110001 


655 10 6 


Format: tgeu rs, rt MIPS32 

Purpose: 

To compare GPRs and do a conditional trap 

Description: if rs > rt then Trap 

Compare the contents of GPR rs and GPR rt as unsigned integers; if GPR rs is greater than or equal to GPR rt, then 
take a Trap exception. 

The contents of the code field are ignored by hardware and may be used to encode information for system software. 
To retrieve the information, system software must load the instruction word from memory. 

Restrictions: 

None 

Operation: 

if (0 | | GPR [ rs ] ) > (0 j | GPR[rt]) then 
SignalException (Trap) 
endif 

Exceptions: 

Trap 
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Probe TLB for Matching Entry 


TLBP 
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1 

000 0000 0000 0000 0000 

001000 
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1 
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6 


Format: tlbp MIPS32 

Purpose: 

To find a matching entry in the TLB. 

Description: 

The Index register is loaded with the address of the TLB entry whose contents match the contents of the EntryHi reg- 
ister. If no TLB entry matches, the high-order bit of the Index register is set. In Release 1 of the Architecture, it is 
implementation dependent whether multiple TLB matches are detected on a TLBP. However, implementations are 
strongly encouraged to report multiple TLB matches only on a TLB write. In Release 2 of the Architecture, multiple 
TLB matches may only be reported on a TLB write. 

Restrictions: 

If access to Coprocessor 0 is not enabled, a Coprocessor Unusable Exception is signaled. 

Operation: 

Index <— 1 | ] UNPREDICTABLE 31 

for i in 0 . . . TLBEntries-1 

if ( ( TLB [ i ] VPN2 and not (TLB [ i ] Mask ) ) = 

(EntryHi VPN2 and not (TLB [ i ] Mask ) ) ) and 
( ( TLB [ i ] G = 1) or (TLB[i] ASID = EntryHi ASID ) ) then 
Index <— i 
endif 
endf or 

Exceptions: 

Coprocessor Unusable 
Machine Check 
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Read Indexed TLB Entry 


TLBR 
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Format: tlbr MIPS32 

Purpose: 

To read an entry from the TLB. 

Description: 

The EntryHi, EntryLoO, EntryLol, and PageMask registers are loaded with the contents of the TLB entry pointed to 
by the Index register. In Release 1 of the Architecture, it is implementation dependent whether multiple TLB matches 
are detected on a TLBR. However, implementations are strongly encouraged to report multiple TLB matches only on 
a TLB write. In Release 2 of the Architecture, multiple TLB matches may only be reported on a TLB write. Note that 
the value written to the EntryHi, EntryLoO, and EntryLol registers may be different from that originally written to the 
TLB via these registers in that: 

• The value returned in the VPN2 field of the EntryHi register may havethose bits set to zero corresponding to the 
one bits in the Mask field of the TLB entry (the least significant bit of VPN2 corresponds to the least significant 
bit of the Mask field). It is implementation dependent whether these bits are preserved or zeroed after a TLB 
entry is written and then read. 

• The value returned in the PFN field of the EntryLoO and EntryLol registers may havethose bits set to zero 
corresponding to the one bits in the Mask field of the TLB entry (the least significant bit of PFN corresponds to 
the least significant bit of the Mask field). It is implementation dependent whether these bits are preserved or 
zeroed after a TLB entry is written and then read. 

• The value returned in the G bit in both the EntryLoO and EntryLol registers comes from the single G bit in the 
TLB entry. Recall that this bit was set from the logical AND of the two G bits in EntryLoO and EntryLol when 
the TLB was written. 

Restrictions: 

The operation is UNDEFINED if the contents of the Index register are greater than or equal to the number of TLB 
entries in the processor. 

If access to Coprocessor 0 is not enabled, a Coprocessor Unusable Exception is signaled. 
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Read Indexed TLB Entry 


TLBR 


Operation: 

i <— Index 

if i > (TLBEntries - 1) then 

UNDEFINED 

endif 

PageMask Mask <- TLB [ i ] Mask 
EntryHi <— 

(TLB[i] VPN2 and not TLB [ i ] Mask ) 1 I # Masking implementation dependent 
0 5 II TLB [ i ] ASID 
EntryLol < — 0 2 |[ 

(TLB[i] PFN1 and not TLB [ i ] Mask ) 1 I # Masking implementation dependent 
TLB [ i ] C1 I! TLB [ i ] D1 || TLB[i] vl I I TLB [ i ] G 
EntryLoO < — 0 2 | I 

( TLB [ i ] pfn q and not TLB [ i ] Mask ) 1 I # Masking implementation dependent 
TLB [ i ] c0 II TLB [ i ] D0 II TLB[i] v0 || TLB [ i ] G 


Exceptions: 

Coprocessor Unusable 
Machine Check 
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Write Indexed TLB Entry 


TLBWI 
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Format: tlbwi MIPS32 

Purpose: 

To write a TLB entry indexed by the Index register. 

Description: 

The TLB entry pointed to by the Index register is written from the contents of the EntryHi, EntryLoO, EntryLol, and 
PageMask registers. It is implementation dependent whether multiple TLB matches are detected on a TLBWI. In 
such an instance, a Machine Check Exception is signaled. In Release 2 of the Architecture, multiple TLB matches 
may only be reported on a TLB write. The information written to the TLB entry may be different from that in the 
EntryHi, EntryLoO, and EntryLol registers, in that: 

• The value written to the VPN2 field of the TLB entry may have those bits set to zero corresponding to the one 
bits in the Mask field of the PageMask register (the least significant bit of VPN2 corresponds to the least 
significant bit of the Mask field). It is implementation dependent whether these bits are preserved or zeroed 
during a TLB write. 

• The value written to the PFNO and PFN 1 fields of the TLB entry may have those bits set to zero corresponding to 
the one bits in the Mask field of PageMask register (the least significant bit of PFN corresponds to the least 
significant bit of the Mask field). It is implementation dependent whether these bits are preserved or zeroed 
during a TLB write. 

• The single G bit in the TLB entry is set from the logical AND of the G bits in the EntryLoO and EntryLol 
registers. 

Restrictions: 

The operation is UNDEFINED if the contents of the Index register are greater than or equal to the number of TLB 
entries in the processor. 

If access to Coprocessor 0 is not enabled, a Coprocessor Unusable Exception is signaled. 
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Write Indexed TLB Entry 


TLBWI 


Operation: 

i <— Index 

TLB [i] Mask PageMask Mask 

TLB[i] VPN2 <— EntryHi VPN2 and not PageMask Mask # Implementation dependent 

TLB[i] ASID EntryHi ASID 

TLB[i]g <— EntryLolg and EntryLoOg 


TLB [i 

1 PFN1 4 

f— EntryLol PFN 

and not 

PageMask Mask 

# 

Implementation 

dependent 

TLB [i 

1 Cl <" 

EntryLolg 






TLB [i 

Id l 

EntryLol D 






TLB [i 

1 VI 

EntryLol v 






TLB [i 

1 PFN0 ” 

EntryLo0 PFN 

and not 

PageMask Mask 

# 

Implementation 

dependent 

TLB [i 

1 CO 

EntryLo0 c 






TLB [i 

1 DO 

EntryLo0 D 






TLB [i 

Ivo 

EntryLo0 v 







Exceptions: 

Coprocessor Unusable 
Machine Check 


MIPS32™ Architecture For Programmers Volume II, Revision 2.00 


Copyright © 2001-2003 MIPS Technologies Inc. All rights reserved. 




Write Random TLB Entry 


TLBWR 
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Format: tlbwr MIPS32 

Purpose: 

To write a TLB entry indexed by the Random register. 

Description: 

The TLB entry pointed to by the Random register is written from the contents of the EntryHi, Entry Lo(), EntryLol, 
and PageMask registers. It is implementation dependent whether multiple TLB matches are detected on a TLBWR. 
In such an instance, a Machine Check Exception is signaled. In Release 2 of the Architecture, multiple TLB matches 
may only be reported on a TLB write. The information written to the TLB entry may be different from that in the 
EntryHi, EntryLoO, and EntryLol registers, in that: 

• The value written to the VPN2 field of the TLB entry may have those bits set to zero corresponding to the one 
bits in the Mask field of the PageMask register (the least significant bit of VPN2 corresponds to the least 
significant bit of the Mask field). It is implementation dependent whether these bits are preserved or zeroed 
during a TLB write. 

• The value written to the PFNO and PFN 1 fields of the TLB entry may have those bits set to zero corresponding to 
the one bits in the Mask field of PageMask register (the least significant bit of PFN corresponds to the least 
significant bit of the Mask field). It is implementation dependent whether these bits are preserved or zeroed 
during a TLB write. 

• The single G bit in the TLB entry is set from the logical AND of the G bits in the EntryLoO and EntryLol 
registers. 

Restrictions: 

If access to Coprocessor 0 is not enabled, a Coprocessor Unusable Exception is signaled. 
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Write Random TLB Entry 


TLBWR 


Operation: 

i Random 

TLB [i] Mask PageMask Mask 

TLB[i] VPN2 <— EntryHi VPN2 and not PageMask Mask # Implementation dependent 

TLB [ i ] as ID T^ntnyHi^siD 

TLB[i]g <— EntryLolg and EntryLoOg 


TLB [i 

1 PFN1 4 

t— EntryLol PFN 

and not 

PageMask Mask 

# 

Implementation 

dependent 

TLB [i 

1 Cl 

EntryLolg 






TLB [i 

Idi 

EntryLol D 






TLB [i 

1 VI 

EntryLol v 






TLB [i 

1 PFN0 ” 

f— EntryLo0 PFN 

and not 

PageMask Mask 

# 

Implementation 

dependent 

TLB [i 

1 CO 

EntryLoOg 






TLB [i 

1 DO <— 

EntryLo0 D 






TLB [i 

Ivo 

EntryLo0 v 







Exceptions: 

Coprocessor Unusable 
Machine Check 
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Trap if Less Than 


TLT 


31 26 25 21 20 16 15 6 5 0 


SPECIAL 




TLT 


rs 

rt 

code 


000000 




110010 


655 10 6 


Format: tlt rs, rt MIPS32 

Purpose: 

To compare GPRs and do a conditional trap 

Description: if rs < rt then Trap 

Compare the contents of GPR rs and GPR rt as signed integers; if GPR rs is less than GPR rt, then take a Trap excep- 
tion. 

The contents of the code field are ignored by hardware and may be used to encode information for system software. 
To retrieve the information, system software must load the instruction word from memory. 

Restrictions: 

None 

Operation: 

if GPR[rs] < GPR[rt] then 
SignalException (Trap) 
endif 

Exceptions: 

Trap 
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Trap if Less Than Immediate 


TLTI 
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16 


Format: tlti rs, 

immediate 



MIPS32 

Purpose: 






To compare a GPR to a constant and do a conditional trap 

Description: if rs < immediate then Trap 

Compare the contents of GPR rs and the 16-bit signed immediate as signed integers; if GPR rs is less than immediate , 
then take a Trap exception. 

Restrictions: 

None 

Operation: 

if GPR[rs] < sign_extend (immediate) then 
SignalException (Trap) 
endif 

Exceptions: 

Trap 
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TLTIU 


Trap if Less Than Immediate Unsigned 
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Format: TLTIU rs, immediate 



MIPS32 

Purpose: 





To compare a GPR to a constant and do a conditional trap 

Description: if rs < immediate then Trap 

Compare the contents of GPR rs and the 16-bit sign-extended immediate as unsigned integers; if GPR rs is less than 
immediate, then take a Trap exception. 

Because the 16-bit immediate is sign-extended before comparison, the instruction can represent the smallest or largest 
unsigned numbers. The representable values are at the minimum [0, 32767] or maximum [max_unsigned-32767, 
max_unsigned] end of the unsigned range. 

Restrictions: 

None 

Operation: 

if (0 |[ GPR[rs]) < (0 I I sign_extend (immediate) ) then 
SignalException (Trap) 
endif 

Exceptions: 

Trap 
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TLTU 


Trap if Less Than Unsigned 
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TLTU 
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rt 
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110011 
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Format: tltu rs, rt MIPS32 

Purpose: 

To compare GPRs and do a conditional trap 

Description: if rs < rt then Trap 

Compare the contents of GPR rs and GPR rt as unsigned integers; if GPR rs is less than GPR rt, then take a Trap 
exception. 

The contents of the code field are ignored by hardware and may be used to encode information for system software. 
To retrieve the information, system software must load the instruction word from memory. 

Restrictions: 

None 

Operation: 

if (0 || GPR[rs] ) < (0 [| GPR[rt] ) then 
SignalException (Trap) 
endif 

Exceptions: 

Trap 
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Trap if Not Equal 
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Format: tne rs, rt MIPS32 

Purpose: 

To compare GPRs and do a conditional trap 

Description: if rs ^ rt then Trap 

Compare the contents of GPR rs and GPR rt as signed integers; if GPR rs is not equal to GPR rt, then take a Trap 
exception. 

The contents of the code field are ignored by hardware and may be used to encode information for system software. 
To retrieve the information, system software must load the instruction word from memory. 

Restrictions: 

None 

Operation: 

if GPR[rs] ^ GPR[rt] then 

SignalException (Trap) 

endif 

Exceptions: 

Trap 
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Trap if Not Equal Immediate 
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Format: tnei rs, 

immediate 



MIPS32 

Purpose: 






To compare a GPR to a constant and do a conditional trap 

Description: if rs + immediate then Trap 

Compare the contents of GPR rs and the 16-bit signed immediate as signed integers; if GPR rs is not equal to imme- 
diate, then take a Trap exception. 

Restrictions: 

None 

Operation: 

if GPR[rs] ^ sign_extend (immediate) then 
SignalException (Trap) 
endif 

Exceptions: 

Trap 
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Floating Point Truncate to Long Fixed Point 


TRUNC.L.fmt 
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Format: 

TRUNC.L. S fd, fs 
TRUNC.L. D fd, fs 




MIPS64 
MIPS32 Release 2 
MIPS64 
MIPS32 Release 2 


Purpose: 

To convert an FP value to 64-bit fixed point, rounding toward zero 

Description: fd convert_and_round (fs) 

The value in FPR/s, in format fmt, is converted to a value in 64-bit long fixed point format and rounded toward zero 
(rounding mode 1). The result is placed in FPR/d. 

When the source value is Infinity, NaN, or rounds to an integer outside the range -2 63 to 2 63 -l, the result cannot be 
represented correctly and an IEEE Invalid Operation condition exists. In this case the Invalid Operation flag is set in 
the FCSR. If the Invalid Operation Enable bit is set in the FCSR , no result is written to fd and an Invalid Operation 
exception is taken immediately. Otherwise, the default result, 2 63 -l, is written to fd. 

Restrictions: 

The fields/? and fd must specify valid FPRs;/,v for type fmt and fd for long fixed point; if they are not valid, the result 

is UNPREDICTABLE. 

The operand must be a value in format fmt, if it is not, the result is UNPREDICTABLE and the value of the operand 
FPR becomes UNPREDICTABLE. 

The result of this instruction is UNPREDICTABLE if the processor is executing in 16 FP registers mode. 

Operation: 

StoreFPR(fd, L, ConvertFmt (ValueFPR ( f s , fmt), fmt, L) ) 
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Floating Point Truncate to Long Fixed Point (cont.) TRUNC.L.fmt 

Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Floating Point Exceptions: 

Unimplemented Operation, Invalid Operation, Overflow, Inexact 
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Floating Point Truncate to Word Fixed Point 


TRUNC.W.fmt 
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fd 
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Format: trunc.w.s fd, fs MIPS32 

trunc.w.d fd, fs MIPS32 


Purpose: 

To convert an FP value to 32-bit fixed point, rounding toward zero 

Description: fd <— convert_and_round (fs) 

The value in FPR fs, in format fmt, is converted to a value in 32-bit word fixed point format using rounding toward 
zero (rounding mode 1). The result is placed in FPR/d. 

o i o 1 

When the source value is Infinity, NaN, or rounds to an integer outside the range -2 to 2 -1, the result cannot be 
represented correctly and an IEEE Invalid Operation condition exists. In this case the Invalid Operation flag is set in 
the FCSR. If the Invalid Operation Enable bit is set in the FCSR , no result is written to fd and an Invalid Operation 

• • • T 1 

exception is taken immediately. Otherwise, the default result, 2 —1, is written to fd. 

Restrictions: 

The fields/s and fd must specify valid FPRs;/,v for type fmt and fd for word fixed point; if they are not valid, the result 

is UNPREDICTABLE. 

The operand must be a value in format fmt, if it is not, the result is UNPREDICTABLE and the value of the operand 
FPR becomes UNPREDICTABLE. 

Operation: 

StoreFPR(fd, W, ConvertFmt (ValueFPR ( f s , fmt), fmt, W) ) 


MIPS32™ Architecture For Programmers Volume II, Revision 2.00 


307 


Copyright © 2001-2003 MIPS Technologies Inc. All rights reserved. 




Floating Point Truncate to Word Fixed Point (cont.) 


TRUNC.W.fmt 


Exceptions: 

Coprocessor Unusable, Reserved Instruction 

Floating Point Exceptions: 

Inexact, Invalid Operation, Overflow, Unimplemented Operation 
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WAIT 


Enter Standby Mode 


31 26 25 24 6 5 0 


COPO 

010000 

CO 

1 

Implementation-Dependent Code 

WAIT 

100000 

6 

1 

19 

6 

Format: wait 



MIPS32 


Purpose: 

Wait for Event 

Description: 

The WAIT instruction performs an implementation-dependent operation, usually involving a lower power mode. 
Software may use bits 24:6 of the instruction to communicate additional information to the processor, and the proces- 
sor may use this information as control for the lower power mode. A value of zero for bits 24:6 is the default and must 
be valid in all implementations. 

The WAIT instruction is typically implemented by stalling the pipeline at the completion of the instruction and enter- 
ing a lower power mode. The pipeline is restarted when an external event, such as an interrupt or external request 
occurs, and execution continues with the instruction following the WAIT instruction. It is implementation-dependent 
whether the pipeline restarts when a non-enabled interrupt is requested. In this case, software must poll for the cause 
of the restart. The assertion of any reset or NMI must restart the pipeline and the corresponding exception must be 
taken. 

If the pipeline restarts as the result of an enabled interrupt, that interrupt is taken between the WAIT instruction and 
the following instruction (EPC for the interrupt points at the instruction following the WAIT instruction). 

Restrictions: 

The operation of the processor is UNDEFINED if a WAIT instruction is placed in the delay slot of a branch or a 
jump. 

If access to Coprocessor 0 is not enabled, a Coprocessor Unusable Exception is signaled. 
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Enter Standby Mode (cont.) 


WAIT 


Operation: 

I : Enter implementation dependent lower power mode 

1+1:/* Potential interrupt taken here */ 

Exceptions: 

Coprocessor Unusable Exception 
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WRPGPE 


Write to GPR in Previous Shadow Set 


31 

26 

25 21 

20 


16 

15 


11 

10 

0 

COPO 
0100 00 

WRPGPR 
01 110 

rt 

rd 

0 

000 0000 0000 

6 


5 


5 



5 



11 

Format: 

WRPGPR rd, rt 








MIPS32 Release 2 

Purpose: 












To move the contents of a current GPR to a GPR in the previous shadow set. 

Description: SGPR[SRSCti PSS , rd] 4- rt 

The contents of the current GPR rt is moved to the shadow GPR register specified by SRSCtl PSS (signifying the pre- 
vious shadow set number) and rd (specifying the register number within that set). 

Restrictions: 

In implementations prior to Release 2 of the Architecture, this instruction resulted in a Reserved Instruction Excep- 
tion. 

Operation: 

SGPR [ SRSCtlp SS , rd] <- GPR[rt] 

Exceptions: 

Coprocessor Unusable 
Reserved Instruction 
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WSBH 


Word Swap Bytes Within Halfwords 


31 26 25 21 20 16 15 11 10 6 5 0 


SPECIAL3 

0 



WSBH 

BSHFL 



rt 

rd 



011111 

00000 



00010 

100000 


6 5 

5 

5 

5 

6 

Format: wsbh rd, rt 




MIPS32 Release 2 

Purpose: 






To swap the bytes within each halfword of GPR rt and store the value into GPR rd. 

Description: rd <— SwapBytesWithinHalfwords (rt) 

Within each halfword of GPR rt the bytes are swapped, and stored in GPR rd. 

Restrictions: 

In implementations prior to Release 2 of the architecture, this instruction resulted in a Reserved Instruction Excep- 
tion. 

Operation: 

GPR [ rd] <— GPR [ rt 1 23. .16 II GPR [ rt ] 31 _ _ 24 || GPR [rt ] 7 . . 0 || GPR [ rt ] 15 . . 8 

Exceptions: 

Reserved Instruction 

Programming Notes: 

The WSBH instruction can be used to convert halfword and word data of one endianness to another endianness. The 
endianness of a word value can be converted using the following sequence: 


lw 

to. 

0 (al) 

/* 

Read word value */ 

wsbh 

to, 

to 

/* 

Convert endiannes of the halfwords */ 

rotr 

to, 

to, 16 

/* 

Swap the halfwords within the words */ 


Combined with SEH and SRA, two contiguous halfwords can be loaded from memory, have their endianness con- 
verted, and be sign-extended into two word values in four instructions. For example: 


lw 

to. 

0 (al) 

/* 

Read 

two contiguous halfwords */ 



wsbh 

to, 

to 

/* 

Convert endiannes of the halfwords 

*/ 


seh 

tl. 

to 

/* 

tl = 

lower halfword sign-extended 

to word 

*/ 

sra 

to, 

to, 16 

/* 

to = 

upper halfword sign-extended 

to word 

*/ 


Zero-extended words can be created by changing the SEH and SRA instructions to ANDI and SRL instructions, 
respectively. 


312 


MIPS32™ Architecture For Programmers Volume II, Revision 2.00 


Copyright © 2001-2003 MIPS Technologies Inc. All rights reserved. 




Exclusive OR 


XOR 


31 26 25 21 20 16 15 11 10 6 5 0 


SPECIAL 




0 

XOR 


rs 

rt 

rd 



000000 




00000 

100110 


6 5 

5 

5 

5 

6 

Format: XOR rd, rs, rt 

Purpose: 

To do a bitwise logical Exclusive OR 

Description: rd rs xor rt 




MIPS32 


Combine the contents of GPR rs and GPR rt in a bitwise logical Exclusive OR operation and place the result into 
GPR rd. 

Restrictions: 

None 

Operation: 

GPR [ rd] 4- GPR [ rs ] xor GPR[rt] 

Exceptions: 

None 
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XORI 


Exclusive OR Immediate 


31 

26 

25 

21 

20 16 

15 

0 

XORI 

001110 



rs 

rt 

immediate 


6 



5 

5 

16 


Format: 

Purpose: 

XORI 

rt, 

rs, immediate 


MIPS32 


To do a bitwise logical Exclusive OR with a constant 

Description: rt < — rs XOR immediate 

Combine the contents of GPR rs and the 16-bit zero-extended immediate in a bitwise logical Exclusive OR operation 
and place the result into GPR rt. 

Restrictions: 

None 

Operation: 

GPR[rt] GPRfrs] xor zero_extend (immediate) 

Exceptions: 

None 
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Appendix A 


Instruction Bit Encodings 


A.l Instruction Encodings and Instruction Classes 

Instruction encodings are presented in this section; field names are printed here and throughout the book in italics. 

When encoding an instruction, the primary opcode field is encoded first. Most opcode values completely specify an 
instruction that has an immediate value or offset. 

Opcode values that do not specify an instruction instead specify an instruction class. Instructions within a class are 
further specified by values in other fields. For instance, opcode REGIMM specifies the immediate instruction class, 
which includes conditional branch and trap immediate instructions. 


A.2 Instruction Bit Encoding Tables 

This section provides various bit encoding tables for the instructions of the MIPS32 ISA. 

Figure A-l shows a sample encoding table and the instruction opcode field this table encodes. Bits 31. .29 of the opcode 
field are listed in the leftmost columns of the table. Bits 28.. 26 of the opcode field are listed along the topmost rows of 
the table. Both decimal and binary values are given, with the first three bits designating the row, and the last three bits 
designating the column. 

An instruction’s encoding is found at the intersection of a row (bits 3 1 ..29) and column (bits 28. .26) value. For instance, 
the opcode value for the instruction labelled EX1 is 33 (decimal, row and column), or 01 101 1 (binary). Similarly, the 
opcode value for EX2 is 64 (decimal), or 110100 (binary). 
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Appendix A Instruction Bit Encodings 


31 


26 25 


21 20 


16 15 


opcode 

rs 

rt 

immediate 



16 


Binary encoding of 
opcode (28. .26) 


Decimal encoding of 
opcode (28. .26) 


opcode bits 28. .26 


7 


bits 31.. 29 

0 

1 

2 

3 

4 

5 

6 

' 7 ^ 

000 

001 

010 

011 

100 

101 

110 

111 ’ 

0 

000 









1 

001 









2 

010 









3 

Oil 




EX1 





4 

100 









5 

101 









6 

110 





EX2 





111, 

» 









Decimal encoding of 
opcode (31 ..29) 


'Binary encoding of 
opcode (31 ..29) 


Figure A-l Sample Bit Encoding Table 

Tables A-2 through A-20 describe the encoding used for the MIPS32 ISA. Table A-l describes the meaning of the 
symbols used in the tables. 


Table A-l Symbols Used in the Instruction Encoding Tables 


Symbol 

Meaning 

* 

Operation or field codes marked with this symbol are reserved for future use. Executing such an 
instruction must cause a Reserved Instruction Exception. 

6 

(Also italic field name.) Operation or field codes marked with this symbol denotes a field class. 
The instruction word must be further decoded by examining additional tables that show values for 
another instruction field. 

P 

Operation or field codes marked with this symbol represent a valid encoding for a higher-order 
MIPS ISA level or a new revision of the Architecture. Executing such an instruction must cause a 
Reserved Instruction Exception. 

V 

Operation or field codes marked with this symbol represent instructions which were only legal if 
64-bit operations were enabled on implementations of Release 1 of the Architecture. In Release 2 
of the architecture, operation or field codes marked with this symbol represent instructions which 
are legal if 64-bit floating point operations are enabled. In other cases, executing such an 
instruction must cause a Reserved Instruction Exception (non-coprocessor encodings or 
coprocessor instruction encodings for a coprocessor to which access is allowed) or a Coprocessor 
Unusable Exception (coprocessor instruction encodings for a coprocessor to which access is not 
allowed). 
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A. 2 Instruction Bit Encoding Tables 


Table A-l Symbols Used in the Instruction Encoding Tables 


Symbol 

Meaning 

0 

Operation or field codes marked with this symbol are available to licensed MIPS partners. To 
avoid multiple conflicting instruction definitions, MIPS Technologies will assist the partner in 
selecting appropriate encodings if requested by the partner. The partner is not required to consult 
with MIPS Technologies when one of these encodings is used. If no instruction is encoded with 
this value, executing such an instruction must cause a Reserved Instruction Exception ( SPECIAL 2 
encodings or coprocessor instruction encodings for a coprocessor to which access is allowed) or 
a Coprocessor Unusable Exception (coprocessor instruction encodings for a coprocessor to which 
access is not allowed). 

a 

Field codes marked with this symbol represent an EJTAG support instruction and implementation 
of this encoding is optional for each implementation. If the encoding is not implemented, 
executing such an instruction must cause a Reserved Instruction Exception. If the encoding is 
implemented, it must match the instruction encoding as shown in the table. 

8 

Operation or field codes marked with this symbol are reserved for MIPS Application Specific 
Extensions. If the ASE is not implemented, executing such an instruction must cause a Reserved 
Instruction Exception. 

<i> 

Operation or field codes marked with this symbol are obsolete and will be removed from a future 
revision of the MIPS32 ISA. Software should avoid using these operation or field codes. 

© 

Operation or field codes marked with this symbol are valid for Release 2 implementations of the 
architecture. Executing such an instruction in a Release 1 implementation must cause a Reserved 
Instruction Exception. 


Table A-2 MIPS32 Encoding of the Opcode Field 


opcode 

bits 28.. 26 

bits 31. .29 

0 

1 

2 

3 

4 

5 

6 

7 

000 

001 

010 

011 

100 

101 

110 

111 

o 

000 

SPECIAL S 

REGIMM 8 

J 

JAL 

BEQ 

BNE 

BT.F.Z 

BGTZ 

1 

001 

ADDI 

ADDIU 

SLTI 

SLTIU 

ANDI 

ORI 

XORI 

LUI 

2 

010 

COPO 8 

COP1 8 

copi es 

COP1X 1 8 

BEQL <() 

BNEL <f> 

BLEZL (|> 

BGTZL ij) 

3 

Oil 

P 

P 

P 

P 

SPECIAL2 8 

JALXe 

e 

SPECIALS 2 

80 

4 

100 

LB 

LH 

LWL 

LW 

LBU 

LHU 

LWR 

P 

m 

101 

SB 

SH 

SWL 

SW 

P 

P 

SWR 

CACHE 

6 

110 

LL 

LWC1 

LWC2 9 

PREF 

P 

LDC1 

LDC2 0 

P 

7 

111 

SC 

SWC1 

SWC2 0 

* 

P 

SDC1 

SDC2 0 

P 


1. In Release 1 of the Architecture, the COP1X opcode was called COP3, and was available as another user-available coprocessor. In 

Release 2 of the Architecture, a full 64-bit floating point unit is available with 32-bit CPUs, and the COP1X opcode is reserved for 
that purpose on all Release 2 CPUs. 32-bit implementations of Release 1 of the architecture are strongly discouraged from using 
this opcode for a user-available coprocessor as doing so will limit the potential for an upgrade path to a 64-bit floating point unit. 

2. Release 2 of the Architecture added the SPECIAL3 opcode. Implementations of Release 1 of the Architecture signaled a Reserved 

Instruction Exception for this opcode. 
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Table A-3 MIPS32 SPECIAL Opcode Encoding of Function Field 


function 

bits 2..0 

bits 5.. 3 

0 

1 

2 

3 

4 

5 

6 

7 

000 

001 

010 

011 

100 

101 

110 

111 

0 

000 

SLL 1 

MOVCI 5 

SRL 8 

SRA 

SLLV 

* 

SRLV 8 

SRAV 

1 

001 

JR 2 

JALR 2 

MOVZ 

MOVN 

SYSCALL 

BREAK 

* 

SYNC 

2 

010 

MFHI 

MTHI 

MFLO 

MTLO 

P 

* 

P 

P 

3 

Oil 

MULT 

MULTU 

DIV 

DIVU 

P 

P 

P 

P 

4 

100 

ADD 

ADDU 

SUB 

SUBU 

AND 

OR 

XOR 

NOR 

5 

101 

* 

* 

SLT 

SLTU 

P 

P 

P 

P 

6 

no 

TGE 

TGEU 

TLT 

TLTU 

TEQ 

* 

TNE 

* 

7 

in 

P 

* 

P 

P 

P 

* 

P 

P 


1. Specific encodings of the rt, rd, and sa fields are used to distinguish among the SLL, NOP, SSNOP and EHB functions. 

2. Specific encodings of the hint field are used to distinguish JR from JR.HB and JALR from JALR.HB 


Table A-4 MIPS32 REGIMM Encoding of rt Field 


rt 

bits 18.. 16 

bits 20.. 19 

0 

1 

2 

3 

4 

5 

6 

7 

000 

001 

010 

011 

100 

101 

110 

111 

0 

00 

BLTZ 

BGEZ 

BLTZL (|> 

BGEZL $ 

* 

* 

* 

* 

1 

01 

TGEI 

TGEIU 

TLTI 

TLTIU 

TEQI 

* 

TNEI 

* 

2 

10 

BLTZAL 

BGEZ AT. 

BLTZ ALL (|) 

BGEZALL <|> 

* 

* 

* 

* 

3 

11 

* 

* 

* 

* 

* 

* 

* 

SYNCI ® 


Table A-5 MIPS32 SPECIAL2 Encoding of Function Field 


function 

bits 2..0 

bits 5.. 3 

0 

i 

2 

3 

4 

5 

6 

7 

000 

001 

010 

Oil 

100 

101 

no 

111 

0 

000 

MADD 

MADDU 

MUL 

0 

MSUB 

MSUBU 

0 

0 

1 

001 

e 

e 

0 

0 

0 

0 

0 

0 

2 

010 

e 

e 

0 

0 

0 

0 

0 

0 

3 

Oil 

e 

e 

0 

0 

0 

0 

0 

0 

4 

100 

CLZ 

CLO 

0 

0 

P 

P 

0 

0 

5 

101 

e 

0 

0 

0 

0 

0 

0 

0 

6 

no 

e 

0 

0 

0 

0 

0 

0 

0 

7 

in 

e 

0 

0 

0 

0 

0 

0 

SDBBP 0 


Table A-6 MIPS32 SPECIAL3 1 Encoding of Function Field for Release 2 of the Architecture 


function 

bits 2..0 

bits 5.. 3 

0 

1 

2 

3 

4 

5 

6 

7 

000 

001 

010 

Oil 

100 

101 

no 

111 

0 

000 

EXT® 

P 

P 

P 

INS ffi 

P 

P 

P 

1 

001 

* 

* 

* 

* 

* 

* 

* 

* 

2 

010 

* 

* 

* 

* 

* 

* 

* 

* 

3 

on 

* 

* 

* 

* 

* 

* 

* 

* 

4 

100 

BSHFL ®8 

* 

* 

* 

P 

* 

* 

* 

5 

101 

* 

* 

* 

* 

* 

* 

* 

* 

6 

no 

* 

* 

* 

* 

* 

* 

* 

* 

7 

in 

* 

* 

* 

RDHWR ffi 

* 

* 

* 

* 
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A. 2 Instruction Bit Encoding Tables 


1. Release 2 of the Architecture added the SPECIAL3 opcode. Implementations of Release 1 of the Architecture signaled a Reserved 
Instruction Exception for this opcode and all function field values shown above. 


Table A-7 MIPS32 MOV Cl Encoding of tf Bit 


tf 

bit 16 


0 

1 

MOVF 

MOVT 


Table A-8 MIPS32 1 SRL Encoding of Shift/Rotate 


R 

bit 21 


0 

1 

SRL 

ROTR 


1 . Release 2 of the Architecture added the 
ROTR instruction. Implementations 
of Release 1 of the Architecture ig- 
nored bit 21 and treated the instruc- 
tion as an SRL 


Table A-9 MIPS32 1 SRLV Encoding of Shift/Rotate 


R 

bit 6 


0 

1 

SRLV 

ROTRV 


1 . Release 2 of the Architecture added the 
ROTRV instruction. Implementa- 
tions of Release 1 of the Architecture 
ignored bit 6 and treated the instruc- 
tion as an SRLV 


Table A-10 MIPS32 BSHFL Encoding of sa Field 1 


sa 

bits 8.. 6 

bits 10.. 9 

0 

1 

2 

3 

4 

5 

6 

7 

000 

001 

010 

011 

100 

101 

110 

111 

0 

00 



WSBH 






1 

01 









2 

10 

SEB 








3 

11 

SEH 









1. The sa field is sparsely decoded to identify the final instructions. Entries in this table with no mnemonic are reserved for future use 
by MIPS Technologies and may or may not cause a Reserved Instruction exception. 


Table A-ll MIPS32 COPO Encoding of rs Field 


rs 

bits 2 3.. 21 

bits 2 5.. 24 

0 

1 

2 

3 

4 

5 

6 

7 

000 

001 

010 

011 

100 

101 

110 

111 

0 

00 

MFC0 

p 

* 

* 

MTC0 

P 

* 

* 

1 

01 

* 

* 

RDPGPR © 

MFMC0 1 8© 

* 

* 

WRPGPR © 

* 

2 

10 

COS 

3 

11 
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1. Release 2 of the Architecture added the MFMCO function, which is further decoded as the DI and El instructions. 


Table A-12 MIPS32 COPO Encoding of Function Field When rs =CO 


function 

bits 2..0 

bits 5.. 3 

0 

1 

2 

3 

4 

5 

6 

7 

000 

001 

010 

011 

100 

101 

110 

111 

0 

000 

* 

TLBR 

TLBWI 

* 

* 

* 

TLBWR 

* 

1 

001 

TLBP 

* 

* 

* 

* 

* 

* 

* 

2 

010 

* 

* 

* 

* 

* 

* 

* 

* 

3 

Oil 

ERET 

* 

* 

* 

* 

* 

* 

DERET a 

4 

100 

WAIT 

* 

* 

* 

* 

* 

* 

* 

5 

101 

* 

* 

* 

* 

* 

* 

* 

* 

6 

110 

* 

* 

* 

* 

* 

* 

* 

* 

7 

111 

* 

* 

* 

* 

* 

* 

* 

* 


Table A-13 MIPS32 COP l Encoding of rs Field 


rs 

bits 2 3.. 21 

bits 25.. 24 

0 

1 

2 

3 

4 

5 

6 

7 

000 

001 

010 

Oil 

100 

101 

110 

111 

0 

00 

MFC1 

p 

CFC1 

MFHC1 © 

MTC1 

P 

CTC1 

MTHC1 © 

1 

01 

BC1 S 

BC1ANY2 SeV 

BC1ANY4 SeV 

* 

* 

* 

* 

* 

— 

10 

S 8 

D S 

* 

* 

W5 

LS 

PS 5 

* 

~ 

11 

* 

* 

* 

* 

* 

* 

* 

* 


Table A-14 MIPS32 CO PI Encoding of Function Field When rs=S 


function 

bits 2..0 


0 

i 

2 

3 

4 

5 

6 

7 


001 

010 

Oil 

100 

101 

110 

111 

B 

000 

ADD 

SUB 

MUL 

DIV 

SQRT 

ABS 

MOV 

NEG 

i 

001 

ROUND.L V 

TRUNC.L V 

CEIL.L V 

FLOOR.L V 

ROUND. W 

TRUNC.W 

CEIL.W 

FLOOR. W 

B 

010 

* 

MOVCF 8 

MOVZ 

MOVN 

* 

RECIP V 

RSQRT V 

* 

B 

Oil 

* 

* 

* 

* 

RECIP2 eV 

RECIP1 eV 

RSQRT1 eV 

RSQRT2 eV 

4 

100 

* 

CVT.D 

* 

* 

CVT.W 

CVT.L V 

CVT.PSV 

* 

B 

101 

* 

* 

* 

* 

* 

* 

* 

* 

6 

110 

C.F 

CABS.F eV 

C.UN 

CABS.UN eV 

C.EQ 

CABS.EQ eV 

C.UEQ 

CABS.UEQ eV 

COLT 

CABS.OLTeV 

C.ULT 

CABS.ULTeV 

C.OLE 

CABS.OLE eV 

C.ULE 

CABS.ULE eV 

7 

111 

C.SF 

CABS.SF £V 

C.NGLE 

CABS.NGLEeV 

C.SEQ 

CABS.SEQ eV 

C.NGL 

CABS.NGL eV 

C.LT 

CABS.LT eV 

C.NGE 

CABS.NGE eV 

C.LE 

CABS.LE eV 

C.NGT 

CABS.NGT eV 
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A. 2 Instruction Bit Encoding Tables 


Table A-15 MIPS32 COP1 Encoding of Function Field When rs =D 


function 

bits 2..0 


0 

i 

2 

3 

4 

5 

6 

7 

000 

001 

010 

Oil 

100 

101 

110 

111 

D 

000 

ADD 

SUB 

MUL 

DIV 

SQRT 

ABS 

MOV 

NEG 

i 

001 

ROUND.L V 

TRUNC.L V 

CEIL.L V 

FLOOR.L V 

ROUND. W 

TRUNC.W 

CEIL.W 

FLOOR. W 

H 

010 

* 

MOVCF 8 

MOVZ 

MOVN 

* 

RECIP V 

RSQRT V 

* 

H 

Oil 

* 

* 

* 

* 

RECIP2 eV 

RECIP1 eV 

RSQRT1 eV 

RSQRT2 eV 

4 

100 

CVT.S 

* 

* 

* 

CVT.W 

CVT.L V 

* 

* 

H 

101 

* 

* 

* 

* 

* 

* 

* 

* 

D 

110 

C.F 

CABS.F eV 

C.UN 

CABS. UN eV 

C.EQ 

CABS.EQ eV 

C.UEQ 

CABS.UEQ eV 

COLT 

CABS.OLTeV 

C.ULT 

CABS.ULTeV 

C.OLE 

CABS. OLE eV 

C.ULE 

CABS.ULE eV 

7 

111 

C.SF 

CABS.SF £V 

C.NGLE 
CABS.NGLE eV 

C.SEQ 

CABS.SEQ eV 

C.NGL 

CABS.NGL eV 

C.LT 

CABS.LT eV 

C.NGE 

CABS.NGE eV 

C.LE 

CABS.LE £V 

C.NGT 

CABS.NGT eV 


Table A-16 MIPS32 COP1 Encoding of Function Field When rs= W or L 1 


function 

bits 2..0 

bits 5.. 3 

0 

1 

2 

3 

4 

5 

6 

7 

000 

001 

010 

011 

100 

101 

110 

111 

0 

000 

* 

* 

* 

* 

* 

* 

* 

* 

1 

001 

* 

* 

* 

* 

* 

* 

* 

* 

2 

010 

* 

* 

* 

* 

* 

* 

* 

* 

3 

Oil 

* 

* 

* 

* 

* 

* 

* 

* 

4 

100 

CVT.S 

CVT.D 

* 

* 

* 

* 

CVT.PS.PW eV 

* 

5 

101 

* 

* 

* 

* 

* 

* 

* 

* 

6 

110 

* 

* 

* 

* 

* 

* 

* 

* 

7 

111 

* 

* 

* 

* 

* 

* 

* 

* 


1 . Format type L is legal only if 64-bit floating point operations are enabled. 


Table A-17 MIPS64 COP1 Encoding of Function Field When rs=PS 1 


function 

bits 2..0 

bits 5.. 3 

0 

1 

2 

3 

4 

5 

6 

7 

000 

001 

010 

011 

100 

101 

110 

111 

0 

000 

ADD V 

SUB V 

MUL V 

* 

* 

ABS V 

MOW 

NEG V 

1 

001 

* 

* 

* 

* 

* 

* 

* 

* 

2 

010 

* 

MOVCF 8V 

MOVZ V 

MOVN V 

* 

* 

* 

* 

3 

Oil 

ADDR eV 

* 

MULR eV 

* 

RECIP2 eV 

RECIP 1 eV 

RSQRT 1 eV 

RSQRT2 eV 

4 

100 

CVT.S. PU V 

* 

* 

* 

CVT.PW.PS eV 

* 

* 

* 

5 

101 

CVT.S.PL V 

* 

* 

* 

PLL.PS V 

PLU.PS V 

PUL PS V 

PUU.PS V 

6 

110 

C.F V 
CABS.F eV 

C.UN V 
CABS. UN eV 

C.EQ V 
CABS.EQ eV 

C.UEQ V 
CABS.UEQ eV 

C.OLT V 
CABS.OLT eV 

C.ULT V 
CABS.ULT eV 

C.OLE V 
CABS.OLEeV 

C.ULE V 
CABS.ULE eV 

7 

111 

C.SF V 
CABS.SF eV 

C.NGLE V 
CABS.NGLEeV 

C.SEQ V 
CABS.SEQ eV 

C.NGL V 
CABS.NGL eV 

C.LT V 
CABS.LT eV 

C.NGE V 
CABS.NGE eV 

r T F V 
CABS.LE £V 

C.NGT V 
CABS.NGT eV 


1 . Format type PS is legal only if 64-bit floating point operations are enabled. 


Table A-18 MIPS32 COP1 Encoding of tf Bit When rs=S, D, or PS, Function=M0FCF 


tf 

bit 16 


0 

1 

MOVF.fmt 

MOVT.fmt 
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Table A-19 MIPS32 COP2 Encoding of rs Field 


rs 

bits 23.. 21 

bits 2 5.. 24 

0 

1 

2 

3 

4 

5 

6 

7 

000 

001 

010 

011 

100 

101 

110 

111 

0 

00 

MFC2 9 

P 

CFC2 0 

MFHC2 0© 

MTC2 0 

P 

CTC2 0 

MTHC2 0© 

1 

01 

BC2 0 

* 

* 

* 

* 

* 

* 

* 

2 

10 

C2 08 

3 

11 


Table A-20 MIPS64 COP1X Encoding of Function Field 1 


function 

bits 2..0 

bits 5.. 3 

0 

1 

2 

3 

4 

5 

6 

7 

000 

001 

010 

011 

100 

101 

110 

111 

0 

000 

LWXC1 V 

LDXC1 V 

* 

* 

* 

LUXC1 V 

* 

* 

1 

001 

SWXC1 V 

SDXC1 V 

* 

* 

* 

SUXC1 V 

* 

PREFX V 

2 

010 

* 

* 

* 

* 

* 

* 

* 

* 

3 

Oil 

* 

* 

* 

* 

* 

* 

ALNV.PS V 

* 

4 

100 

MADD.S V 

MADD.D V 

* 

* 

* 

* 

MADD.PS V 

* 

5 

101 

MSUB.S V 

MSUB.D V 

* 

* 

* 

* 

MSUB.PS V 

* 

6 

110 

NMADD.S V 

NMADD.D V 

* 

* 

* 

* 

NMADD.PS V 

* 

7 

111 

NMSUB.S V 

NMSUB.D V 

* 

* 

* 

* 

NMSUB.PS V 

* 


1. COP1X instructions are legal only if 64-bit floating point operations are enabled. 


A.3 Floating Point Unit Instruction Format Encodings 

Instruction format encodings for the floating point unit are presented in this section. This information is a tabular 
presentation of the encodings described in tables Table A- 13 and Table A-20 above. 


Table A-21 Floating Point Unit Instruction Format Encodings 


fmt field 
(bits 25..21 of 
COP1 opcode) 

fmt3 field 
(bits 2..0 of 
COP1X opcode) 

Mnemonic 

Name 

Bit Width 

Data Type 

Decimal 

Hex 

Decimal 

Hex 

0..15 

00. .OF 

— 

— 

Used to encode Coprocessor 1 interface instructions (MFC1, 
CTC1, etc.). Not used for format encoding. 

16 

10 

0 

0 

S 

Single 

32 

Floating 

Point 

17 

11 

1 

1 

D 

Double 

64 

Floating 

Point 

18.. 19 

12.. 13 

2. .3 

2. .3 

Reserved for future use by the architecture. 

20 

14 

4 

4 

W 

Word 

32 

Fixed Point 

21 

15 

5 

5 

L 

Long 

64 

Fixed Point 

22 

16 

6 

6 

PS 

Paired 

Single 

2x32 

Floating 

Point 

23 

17 

7 

7 

Reserved for future use by the architecture. 
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A. 3 Floating Point Unit Instruction Format Encodings 


Table A-21 Floating Point Unit Instruction Format Encodings 


fmt field 
(bits 25..21 of 
COP1 opcode) 

fmt3 field 
(bits 2..0 of 
COP1X opcode) 

Mnemonic 

Name 

Bit Width 

Data Type 

Decimal 

Hex 

Decimal 

Hex 

24. .31 

18.. IF 

— 

— 

Reserved for future use by the architecture. Not available for 
fmt 3 encoding. 
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Appendix B 


Revision History 


In the left hand page margins of this document you may find vertical change bars to note the location of significant 
changes to this document since its last release. Significant changes are defined as those which you should take note of 
as you use the MIPS IP. Changes to correct grammar, spelling errors or similar may or may not be noted with change 
bars. Change bars will be removed for changes which are more than one revision old. 

Please note: Limitations on the authoring tools make it difficult to place change bars on changes to figures. Change bars 
on figure titles are used to denote a potential change in the figure itself. 


Revision 

Date 

Description 

0.90 

November 1, 2000 

Internal review copy of reorganized and updated architecture documentation. 

0.91 

November 15, 2000 

External review copy of reorganized and updated architecture documentation. 



Changes in this revision: 

0.92 

December 15, 2000 

• Correct sign in description of MSUBU. 



• Update JR and JALR instructions to reflect the changes required by 
MIPS 16. 

0.95 

March 12, 2001 

Update for second external review release. 


Updated based on feedback from all reviews. 


• Add missing optional select field syntax in mtcO/mfcO instruction 
descriptions. 

• Correct the PREF instruction description to acknowledge that the 
PrepareForStore function does, in fact, modify architectural state. 

• To provide additional flexibility for Coprocessor 2 implementations, extend 
the sel field for DMFCO. DMTCO. MFCO, and MTCO to be 8 bits. 

• Update the PREF instruction to note that it may not update the state of a 
locked cache line. 

• Remove obviously incorrect documentation in DIV and DIVU with regard 
to putting smaller numbers in register rt. 

• Fix the description for MFC2 to reflect data movement from the 
coprocessor 2 register to the GPR, rather than the other way around. 

1.00 August 29, 2002 • Correct the pseudo code for LDC1, LDC2. SDC1, and SDC2 for a MIPS32 

implementation to show the required word swapping. 

• Indicate that the operation of the CACHE instruction is UNPREDICTABLE 
if the cache line containing the instruction is the target of an invalidate or 
writeback invalidate. 

• Indicate that an Index Load Tag or Index Store Tag operation of the 
CACHE instruction must not cause a cache error exception. 

• Make the entire right half of the MFC2, MTC2. CFC2, CTC2, DMFC2, and 
DMTC2 instructions implementation dependent, thereby acknowledging 
that these fields can be used in any way by a Coprocessor 2 implementation. 

• Clean up the definitions of LL, SC, LLD. and SCD. 

• Add a warning that software should not use non-zero values of the stype 
field of the SYNC instruction. 

• Update the compatibility and subsetting rules to capture the current 
requirements. 
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Revision 


1.90 


2.00 


Date Description 

Merge the MIPS Architecture Release 2 changes in for the first release of a 

Relesae 2 processor. Changes in this revision include: 

• All new Release 2 instructions have been included: DI. EHB, El, EXT, INS, 
JALR.HB, JR.HB, MFHC1, MFHC2, MTHC1, MTHC2, RDHWR, 
RDPGPR, ROTR, ROTRV, SEB, SEH, SYNCI, WRPGPR, WSBH. 

• The following instruction definitions changed to reflect Release 2 of the 
Architecture: DERET, ERET. JAL, JALRMR, SRL, SRLV 

September 1, 2002 

• With support for 64-bit FPUs on 32-bit CPUs in Release 2, all floating point 
instructions that were previously implemented by MIPS64 processors have 
been modified to reflect support on either MIPS32 or MIPS64 processors in 
Release 2. 

• All pseudo-code functions have been udpated, and the 
Are64bitFPOperationsEnabled function was added. 

• Update the instruction encoding tables for Release 2. 

Continue with updates to merge Release 2 changes into the document. Changes 

in this revision include: 

• Correct the target GPR (from rd to rt) in the SLTI and SLTIU instructions. 
This appears to be a day-one bug. 

• Correct CPR number, and missing data movement in the pseudocode for the 
MTC0 instruction. 

• Add note to indicate that the CACHE instruction does not take Address 
Error Exceptions due to mis-aligned effective addresses. 

• Update SRL, ROTR, SRLV, ROTRV, DSRL, DROTR, DSRLV, 

T Q mo t DROTRV, DSRL32, and DROTR32 instructions to reflect a 1-bit, rather 

" l “" ' than a 4-bit decode of shift vs. rotate function. 

• Add programming note to the PrepareForStore PREF hint to indicate that it 
can not be used alone to create a bzero-like operation. 

• Add note to the PREF and PREFX instruction indicating that they may 
cause Bus Error and Cache Error exceptions, although this is typically 
limited to systems with high-reliability requirements. 

• Update the SYNCI instruction to indicate that it should not modify the state 
of a locked cache line. 

• Establish specific rules for when multiple TLB matches can be reported (on 
writes only). This makes software handling easier. 
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